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American Political Science Review 


Notes from the Editor 


ANNOUNCEMENT 


The inaugural issue of the American Political Science Review was published in November of 1906. 
To mark the centennial of the APSR, the November 2006 issue will feature a special section devoted 
to considerations of the evolution of political science. This special section will be an extra feature, 
above and beyond the regular complement of research articles. 

The APSR is actively soliciting submissions on the broad theme of the special section. 

If you are interested in submitting a paper, please contact the editor of the APSR at apsr@gwu.edu 
to express your interest and to provide a brief description of the paper you would like to submit. Also, 
please bring this solicitation to the attention of others who may be interested and encourage them to 
contact the editor. 

Like all other papers submitted to the APSR, submissions received in response to this solicitation 
will undergo peer review. Overseeing this process will be the editor of the APSR, Lee Sigelman, and 
a member of the APSR’s editorial board, M. Elizabeth Sanders of Cornell University. 

To be considered for publication, a paper must be no more than 15-17 pages in length, conven- 
tionally formatted (e.g., double-spaced throughout, including notes and references, with margins of 
at least one inch on all sides and set up in at least an 11-point font size). 

In light of the length constraints, authors are advised to address a specific theme rather than 
aiming at a broader, synoptic disciplinary overview. Pertinent examples would include: an analysis 
of a particular turning point in the evolution of the discipline or of the role played by a particular 
individual; a treatment of the evolution of some influential school of thought; a comparison of the 
evolution of two subfields of political science, or of political science in the U.S. and another area, Or 
of political science and another discipline; or a consideration of how a certain type of scholarship has 
influenced and/or been influenced by public policy, or the relationship between political science and 
the state. (These are offered only as examples, rather than as a definitive set of topics to be addressed.) 

To be considered for publication, submissions must reach the APSR office by no later than January 
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2, 2006, but earlier submission is encouraged. 


Inside this Issue 


Relationships ranging from the interpersonal to the in- 
ternational are built on trust. A prerequisite for friend- 
ship and cooperation, trust involves a sense of reliance 
on or confidence in another, especially when one is 
vulnerable. Trust is built up over time by repeated 
confirming instances of support, but it can erode over 
time due to disappointing interactions, and it can even 
be erased completely in a single act of treachery. Our 
cover picture of trapeze artists in midair points to the 
amazing feats that can be accomplished when trust is 
earned and maintained, yet it also hints at the devastat- 
ing consequences that can arise when trust is misplaced. 
These themes run through the articles in this issue of 
APSR. 

From smirks and sighs during presidential debates 
to hard-hitting attack ads, campaigning for political 
office, always a contact sport, has increasingly been 
plagued by “unnecessary roughness” in recent years. In 
“The New Videomalaise: Effects of Televised Incivility 
on Political Trust,” Diana C. Mutz and Byron Reeves 
examine the impact of disparaging political discourse. 
Mutz and Reeves build on an extensive body of 
scholarship that has assumed or hypothesized, but has 
yet to convincingly demonstrate, a direct relationship 
between viewing televised campaign coverage and 
holding “negative” political attitudes. Based on an 





innovative research design that involves, among other 
elements, a mock congressional campaign debate per- 
formed by professional actors and readouts from skin 
conductance electrodes, Mutz and Reeves establish a 
direct link between specific types of televised political 
discourse and levels of political trust. These findings 
yield surprising insights about the malleability of po- 
litical trust and suggest anew that in politics, what you 
say may be less important than how you say it. 

The potential undermining of trust during an elec- 
tion cycle is not necessarily limited to the campaign. 
According to Mark Andreas Kayser, the costs of an 
election extend far beyond what a tally sheet of cam- 
paign expenditures and operating costs might suggest. 
In “Who Surfs, Who Manipulates? The Determinants 
of Opportunistic Election Timing and Electorally Mo- 
tivated Economic Intervention,” Kayser explores the 
relationship between elections and politically moti- 
vated economic intervention. Governments with the 
ability to call elections within a broad time period, he 
suggests, are less likely to manipulate the economy for 
electoral purposes. Assuming that politically motivated 
economic interventions constitute an inefficient use of 
budgetary resources, their elimination would lower the 
incidental costs of elections. This analysis is a must- 
read, especially for those in fixed-election democracies. 
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Of course, playing fast and loose with the rules is not 
confined to domestic politics. In an era of increasing 
globalization, states are no longer the only actors that 
need to be held accountable. In “Accountability and 
the Abuse of Power in World Politics,” Ruth W. Grant 
and Robert O. Keohane go beyond the simple assertion 
that the accountability processes at work within demo- 
cratic states can be transferred to the international sys- 
tem. Instead, they offer a typology of accountability 
mechanisms for different international actors, includ- 
ing states, as well as intergovernmental institutions, 
transnational networks, and nongovernmental organi- 
zations. This typology not only advances prior theo- 
retical and conceptual understandings of accountabil- 
ity in its international context, but also provides both 
a checklist for scrutinizing international actors and a 
road map for international actors that are attentive to 
accountability concerns. 

Responsible global politics requires not only mech- 
anisms of accountability, but something of a common 
identity, or at least a modicum of trust between interna- 
tional actors. Continuing in the same line of scholarship 
that was evidenced in his September 2002 APSR arti- 
cle, “Does Liberal Democracy Presuppose a Cultural 
Nation? Four Arguments,” Arash Abizadeh questions 
the widely held supposition that in-group solidarity 
requires the presence of an out-group against which 
to frame group identity. Such a belief would make it 
impossible to create an all-encompassing global iden- 
tity, which would, by definition, lack the requisite out- 
group. However, Abizadeh argues, in “Does Collec- 
tive Identity Presuppose an Other? On the Alleged 
Incoherence of Global Solidarity,” that this conception 
of identity is fundamentally flawed. Although many 
obstacles to the creation of a global identity remain, 
Abizadeh removes one significant point of misunder- 
standing from its path. 

The politics of us-versus-them is also relevant to 
answering questions regarding domestic politics. For 
example, why do we often see social democratic par- 
ties acting in opposition to their own self-interests? 
In “Insider-Outsider Politics in Industrialized Democ- 
racies: The Challenge to Social Democratic Parties,” 
David Rueda argues that that is the wrong question 
because it starts from an incorrect premise. Social 
democratic parties are not cohesive advocates of labor 
but, instead, are composed of both labor “insiders” 
and “outsiders.” This heterogeneity results in divergent 
preferences, By disaggregating social democratic par- 
ties into their constituent units, Rueda sets the stage for 
a new generation of research on this important topic. 

Insiders and outsiders are similarly found in Amer- 
ican politics and history. In their provocative article, 
“Racial Orders in American Political Development,” 
Desmond S. King and Rogers M. Smith contend that 
a framework of racial orders is a critical component of 
any study of American politics, even for topics that are 
not prima facie race-related. From institutional organi- 
zation to policy preferences, analyses that fail to take 
race seriously are lacking an important explanatory 
variable. At a bare minimum, scholars must justify the 
exclusion of race from their study, although King and 
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Smith advocate a more thorough treatment of the issue 
and urge practitioners to shine a spotlight on this issue. 

With so much scholarly attention devoted to the ex- 
ecutive or cabinet in a parliamentary government, one 
might be forgiven for forgetting that a parliament is first 
and foremost a legislative body. Lanny W. Martin and 
Georg Vanberg remind us of this fact in “Coalition Pol- 
icymaking and Legislative Review.” Going beyond the 
traditional focus on government formation and dissolu- 
tion, Martin and Vanberg examine the tools available 
to governing parties, emphasizing the pivotal role of 
legislative review in coalition governments. Their thor- 
ough theoretical and empirical analysis demonstrates 
that legislators have a meaningful role to play in the 
policymaking process. 

The next article in this issue also investigates who has 
meaningful decision-making roles, this time from a for- 
eign policy perspective. Why did the United States in- 
vade Iraq? Why does the United States support Israel? 
Identifying the sources of influence on such weighty 
decisions is the focus of Lawrence R. Jacobs and 
Benjamin I. Page’s “Who Influences U.S. Foreign Pol- 
icy?” Previous research has identified organized gro- 
ups, epistemic communities, and public opinion as lead- 
ing domestic influences on foreign policy. Jacobs and 
Page present an innovative comparative test of these 
three sources across different issue areas and institu- 
tional settings. Their findings, which dispute commonly 
held beliefs among foreign policy analysts, suggest a 
major rethinking of American foreign policymaking. 

Important decisions are made not only by superpow- 
ers with mighty armies, but also by individual citizens 
with meager assets. One perennial decision-making 
question is whether welfare recipients shop around for 
the most generous states in which to live. Political sci- 
entists have reached conflicting conclusions in trying 
to answer this important public policy question. No 
state wants to be known as a welfare magnet, with the 
attendant spiral of increasing demand and decreasing 
revenues, nor do most states want to be exceedingly 
callous toward the less fortunate, so the answer to this 
question has significant implications for policy makers. 
Michael A. Bailey assesses past research and provides 
new insights in “Welfare and the Multifaceted Deci- 
sion to Move.” Augmenting previous modeling of the 
welfare migration question with a more realistic and 
complete explanation of the considerations involved in 
moving across state lines, Bailey not only finds a modest 
but significant relationship between benefit levels and 
residency decisions, but also discovers that when peo- 
ple decide to move, there is truly no place like home. 

Finally, we revisit Ethan Putterman’s August 2003 
APSR article, “Rousseau on Agenda-Setting and Ma- 
jority Rule.” Controversial in death as he was in life,. 
Rousseau has inspired generations of scholarship and 
debate. Two political theorists of more recent vintage 
continue the discussion in our “Forum” section. First, 
John T. Scott contends that Putterman has misinter- 
preted Rousseau’s argument about representation and 
leadership. That is an important charge in itself, but 
more is at stake here than the correct interpretation 
of Rousseau’s text, for this question brings into play 
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greater issues concerning both the essence and the in- 
tegrity of the democratic process. Putterman’s forceful 
response to Scott should leave readers assured that this 
debate is far from over. 


INSTRUCTIONS TO CONTRIBUTORS 


General Considerations 


The APSR strives to publish scholarly research of 
exceptional merit, focusing on important issues and 
demonstrating the highest standards of excellence 
in conceptualization, exposition, methodology, and 
craftsmanship. Because the APSR reaches a diverse 
audience of scholars and practitioners, authors must 
demonstrate how their analysis illuminates a significant 
research problem, or answers an important research 
question, of general interest in political science. For the 
same reason, authors must strive for a presentation that 
will be understandable to as many scholars as possible, 
consistent with the nature of their material. 

The APSR publishes original work. Therefore, au- 
thors should not submit articles containing tables, 
figures, or substantial amounts of text that have al- 
ready been published or are forthcoming in other 
places, or that have been included in other manuscripts 
submitted for review to book publishers or periodicals 
(including on-line journals). In many such cases, sub- 
sequent publication of this material would violate the 
copyright of the other publisher. The APSR also does 
not consider papers that are currently under review 
by other journals or duplicate or overlap with parts of 
larger manuscripts that have been submitted to other 
publishers (including publishers of both books and 
periodicals). Submission of manuscripts substantially 
similar to those submitted or published elsewhere, or 
as part of a book or other larger work, is also strongly 
discouraged. If you have any questions about whether 
these policies apply in your particular case, you should 
discuss any such publications related to a submission in 
a cover letter to the Editor. You should also notify the 
Editor of any related submissions to other publishers, 
whether for book or periodical publication, that occur 
while a manuscript is under review by the APSR and 
which would fall within the scope of this policy. The 
Editor may request copies of related publications. 

If your manuscript contains quantitative evidence 
and analysis, you should describe your procedures 
in sufficient detail to permit reviewers to understand 
and evaluate what has been done and, in the event 
that the article is accepted for publication, to per- 
mit other scholars to carry out similar analyses on 
. other data sets. For example, for surveys, at the least, 
sampling procedures, response rates, and question 
wordings should be given; you should calculate re- 
sponse rates according to one of the standard formulas 
given by the American Association for Public Opinion 
Research, Standard Definitions: Final Dispositions of 
Case Codes and Outcome Rates for Surveys (Ann 
Arbor, MI: AAPOR, 2000). This document is available 
on the Internet at <http://www.aapor.org/default.asp? 
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page = survey_methods/standards_and_best_practices/ 
standard_definitions>. For experiments, provide full 
descriptions of experimental protocols, methods of 
subject recruitment and selection, subject payments 
and debriefing procedures, and so on. Articles should 
be self-contained, so you should not simply refer read- 
ers to other publications for descriptions of these basic 
research procedures. 

Please indicate variables included in statistical anal- 
yses by capitalizing the first letter in the variable 
name and italicizing the entire variable name the first 
time each is mentioned in the text. You should also use 
the same names for variables in text and tables and, 
wherever possible, should avoid the use of acronyms 
and computer abbreviations when discussing variables 
in the text. All variables appearing in tables should 
have been mentioned in the text and the reason for 
their inclusion discussed. 

As part of the review process, you may be asked 
to submit additional documentation if procedures are 
not sufficiently clear; the review process works most 
efficiently if such information is given in the initial 
submission. If you advise readers that additional infor- 
mation is available, you should submit printed copies 
of that information with the manuscript. If the amount 
of this supplementary information is extensive, please 
inquire about alternate procedures. 

The APSR uses a double-blind review process. You 
should follow the guidelines for preparing anonymous 
copies in the Specific Procedures section below. 

Manuscripts that are largely or entirely critiques or 
commentaries on previously published APSR articles 
will be reviewed using the same general procedures as 
for other manuscripts, with one exception. In addition 
to the usual number of reviewers, such manuscripts will 
also be sent to the scholar(s) whose work is being crit- 
icized, in the same anonymous form that they are sent 
to reviewers, Comments from the original author(s) to 
the Editor will be invited as a supplement to the advice 
of reviewers, This notice to the original author(s) is 
intended (1) to encourage review of the details of 
analyses or research procedures that might escape 
the notice of disinterested reviewers; (2) to enable 
prompt publication of critiques by supplying criticized 
authors with early notice of their existence and, there- 
fore, more adequate time to reply; and (3) as a courtesy 
to criticized authors. If you submit such a manuscript, 
you should therefore send as many additional copies of 
their manuscripts as will be required for this pu 

Manuscripts being submitted for publication should 
be sent to Lee Sigelman, Editor, American Politi- 
cal Science Review, Department of Political Science, 
The George Washington University, Washington, DC 
20052. Correspondence concerning manuscripts under 
review may be sent to the same address or e-mailed to 


apst@pwu.edu. 


Manuscript Formatting 


Manuscripts should not be longer than 45 pages in- 
cluding text, all tables and figures, notes, references, 
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and appendices. This page size guideline is based on the 
U.S. standard 8.5 x 11-inch paper; if you are submitting 
a manuscript printed on longer paper, you must adjust 
accordingly. The font size must be at least 11 points for 
all parts of the paper, including notes and references. 
The entire paper, including notes and references, must 
be double-spaced, with the sole exception of tables 
for which double-spacing would require a second page 
otherwise not needed. All pages should be numbered in 
one sequence, and text should be formatted using a nor- 
mal single column no wider than 6.5 inches, as is typical 
for manuscripts (rather than the double-column format 
of the published version of the APSR), and printed on 
one side of the page only. Include an abstract of no 
more than 150 words. The APSR style of embedded 
citations should be used, and there must be a sepa- 
rate list of references at the end of the manuscript. 
Do not use notes for simple citations. These specifi- 
cations are designed to make it easier for reviewers 
to read and evaluate papers. Papers not adhering to 
these guidelines are subject to being rejected without 
review. 

For submission and review purposes, you may place 
footnotes at the bottom of the pages instead of using 
endnotes, and you may locate tables and figures (on 
separate pages and only one to a page) approximately 
where they fall in the text. However, manuscripts ac- 
cepted for publication must be submitted with end- 
notes, and with tables and figures on separate pages at 
the back of the manuscript with standard indications of 
text placement, e.g., [Table 3 about here]. In deciding 
how to format your initial submission, please consider 
the necessity of making these changes if your paper 
is accepted. If your paper is accepted for publication, 
you will also be required to submit camera-ready copy 
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The New Videomalaise: Effects of Televised Incivility 


on Political Trust 


DIANA C. MUTZ University of Pennsylvania 


BYRON REEVES Stanford University 


oes incivility in political discourse have adverse effects on public regard for politics? If so, why? 

In this study we present a theory suggesting that when viewers are exposed to televised political 

disagreement, it often violates well-established face-to-face social norms for the polite expression 
of opposing views. AS a result, incivility in public discourse adversely affects trust in government. Drawing 
on three laboratory experiments, we find that televised presentations of political differences of opinion 
do not, in and of themselves, harm attitudes toward politics and politicians. However, politcal trust is 
adversely affected by levels of incivility in these exchanges. Our findings suggest that the format of much 
political television effectively promotes viewer interest, but at the expense of political trust. 


oes incivility in political discourse affect pub- 

lic regard for politics? Or is a certain amount 

of rancorous debate part and parcel of what 
citizens expect from political actors? Is watching 
politicians and pundits hurl insults at one another on 
television merely a harmless pastime, or does it have 
consequences for how people think about politics and 
government? In particular, does televised political in- 
civility harm levels of trust in government and politi- 
clans? 

There is now widespread concern in the United 
States about a “civility crisis” in public life. As Rodin 
(1996) suggests, “Across America and increasingly 
around the world, from campuses to the halls of 
Congress, to talk radio and network TY, social and 
political life seem dominated today by incivility....No 
one seems to question the premise that political de- 
bate has become too extreme, too confrontational, too 
coarse.” Calls for greater civility in political discourse 
have come from a wide array of scholars, as well as 
from philanthropic organizations that see incivility as 
a threat to the functioning of democracy.' But to date, 
there has been little effort to confirm empirically the 
negative consequences of incivility. 

A number of scholars suggest that incivility in 
political discourse is one of the key reasons why 
Americans tend to be negative toward politicians and 
political institutions. According to this line of thought, 
political conflict is seen as unnecessary and distaste- 
ful. As Hibbing and Theiss-Morse (1995, 147) explain, 
“Citizens... dislike being exposed to processes en- 
demic to democratic government. People do not wish 
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to see uncertainty, conflicting options, long debate, 
competing interests, confusion, bargaining, and com- 
promised, imperfect solutions.” Durr, Gilmour, and 
Wolbrecht (1997) similarly suggest that it is precisely 
when Congress is doing its job, debating and ultimately 
resolving controversial political issues, that public re- 
gard for this institution declines. 

Others argue more narrowly, suggesting that it is not 
conflict per se, but the incivility with which political 
elites disagree that prompts negative attitudes. Uslaner 
(1993), for example, has suggested that members of 
Congress are increasingly likely to violate norms of 
politeness in their discourse, and Tannen (1998) char- 
acterizes the United States as “a culture of argument” 
that encourages “a pervasive warlike atmosphere.” To 
the casual observer, politicians seem to be perpetually 
involved in bitter conflict. If political conflict is aired 
openly in an uncivil fashion, can citizens be expécted 
to maintain respect for politics and politicians? 

Televised portrayals of political conflict have re- 
ceived a particularly severe beating with respect to 
levels of civility. Elving (1994), for example, cites 
television coverage of congressional floor debate as 
an important source of dissatisfaction with Congress. 
Cappella and Jamieson (1997) point to media reports 
that highlight conflict in politics as a source of greater 
political cynicism. The increasing visibility of politi- 
cal conflict through television seems indisputable (see 
Funk 2001), and even journalists concur that in Ameri- 
can politics, “hyperbole and venomous invective are 
common talk.”* Viewers’ sense that politicians are 
engaged in pointless bickering is assumed to be fed 
by media coverage emphasizing the intensity of con- 
flict whenever possible (see also, McGraw, Willey, and 
Anderson 1998). 

It is unclear whether elite political discourse is really 
any less civil than in the days when duels were occa- 
sionally used to resolve differences of opinion (see, 
e.g., Altschuler and Blumin 2000 and Sapiro 1999). 
However, the dominance of television as a source of 
exposure to politics suggests that, at the very least, 
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the extent of mass public exposure to uncivil political 
discourse probably has increased. It is one thing to read 
about political pundits’ or candidates’ contrary views 
in the press, and quite another to witness them directly 
engaged in vituperative argument “in person.” 

More general theories suggesting that television 
bears some responsibility for negative attitudes toward 
politics and politicians have received enthusiastic re- 
ceptions over the years. For some, the root of the prob- 
lem is the cynicism of game-centered political cover- 
age and journalists’ ongoing denigration of politicians’ 
motives; for others, it is simply the conflict-oriented, 
adversarial nature of political coverage. The timing of 
the well-documented decline in trust toward govern- 
mental institutions initially gave these theories great 
plausibility, but documenting a causal link between 
political television and negative public attitudes has 
proven quite difficult. This difficulty stems from a lack 
of certainty about which aspects of political television 
are most likely to produce negative attitudes, and from 
problems inherent in studying media effects. 


TELEVISION AND POLITICAL TRUST 


In the 1970s, Robinson (1975) popularized the term 
“videomalaise” to refer to negative public attitudes 
that resulted from watching television news. Evidence 
in support of the original videomalaise claim was based 
on a quasi-experimental study of effects from view- 
ing one particular television program. Jn a subsequent 
study (Robinson and Appel 1979), content analyses of 
the three major network news programs showed that 
negative coverage predominated. Although evidence 
of effects remained thin thereafter, an overview of re- 
search on television and politics in the early 80s echoed 
the popularity of this thesis, concluding that political 
television “has altered the culture significantly by in- 
tensifying ordinary Americans’ traditional low opinion 
of politics and politicians, by exacerbating the decline 
in their trust and confidence in their government and 
its institutions” (Ranney 1983, 86). 

Despite the widespread belief that television has 
something to do with low levels of political trust, ev- 
idence supporting this causal claim has been limited. 
In addition, current theories linking trust to political 
media have little to do with television per se. Instead, 
theories about videomalaise have broadened into more 
general claims about political journalism, claims that 
transcend television, newspapers, and virtually all po- 
litical media. Patterson (1993), for example, blames 
negative news coverage from all media for unfavorable 
attitudes toward politicians. He suggests that negative 
commentary from journalists naturally leads members 
of the public to think ill of politicians and the sys- 
tem in which they are embedded (see also Miller, 
Goldenberg, and Erbring 1979). In addition, journalis- 
tic narratives emphasizing the ulterior, self-interested 
motives of political actors have been widely blamed for 


3 Even when candidates “talk past” one another m their discourse, 
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public negativity. If the strategic, “inside dopester” per- 
spective predominates, then the public’s attitudes will 
increasingly reflect this same cynicism (see Cappella 
and Jamieson 1997). 

The shift toward more general theories of “media- 
malaise” does not mean that television has been given 
a complete reprieve from its responsibility for all that 
ails American politics. Instead, the argument evolved 
to blame television journalism for initiating the dele- 
terious shifts in tone and content that subsequently 
spread to other political media. Because of the need to 
compete with television, print media began to mimic 
television, at least so the argument goes (cf Sigelman 
and Bullock 1991). Whatever the rationale, few schol- 
ars now contend that political television, as it currently 
exists, presents a unique perspective on government 
and politics. 

The few exceptions to this generalization are studies 
suggesting that television, by virtue of its visual nature, 
draws attention to certain dimensions of candidate 
evaluation over others. As one journalist summarized 
it in the early 70s, “Television makes the candidate of 
today a human being at one’s elbow, who is going to 
be judged on the same terms as a man greets any new 
acquaintance” (Gould, [1952] 1972, 21). The sensory 
realism of television conveys a sense of intimacy with 
political actors that citizens were unlikely to encounter 
in the past, even in face-to-face meetings with politi- 
cians (Hart 1994). 

At this point, scholars have only begun to study the 
consequences of the up-close and personal intimacy 
that viewers have with political actors on television. 
The main emphasis thus far in the research has been 
on the extent to which televised politics heightens the 
importance of personality characteristics in general 
evaluations of candidates. Keeter (1987), for example, 
found that candidates’ personal qualities were more 
important to voters who obtained political news from 
television. More recently, Druckman (2003) also found 
that television primes people to rely more on person- 
ality perceptions when evaluating candidates. But we 
know little about how citizens react to uncivil political 
discourse when it is up-close and personal on television. 

Would trust in politics and politicians improve if the 
public simply did not witness so much uncivil political 
disagreement on television? Drawing on social psycho- 
logical theories of human-—media interaction, we sug- 
gest that televised political disagreements exacerbate 
the “intensification of feeling” that Walter Lippmann 
(1925) so despised in politics. When political actors en- 
gage in televised interactions that violate the norms for 
everyday, face-to-face discourse, they reaffirm viewers’ 
sense that politicians cannot be counted on to obey 
the same norms for social behavior by which ordinary 
citizens abide. 

Without disputing previous evidence about televi- 
sion’s influence on political trust, this study offers a 
different rationale for why incivility may intensify neg- 
ative attitudes toward government and politicians. We 
examine the impact of televised examples of uncivil po- 
litical discourse, not because incivility is limited to tele- 
vision but, rather, because television is the dominant 
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medium through which citizens are exposed to political 
controversy, and because its images and sounds more 
completely mimic conflict in real interpersonal situa- 
tions. Funk (2001), for example, found that incivility 
among members of Congress conveyed through print 
produced findings in the direction one would expect, 
but they did not reach statistical significance—a null 
finding she predicts would change using televised vari- 
ations in civility. Our goal in this study was to explore 
reactions to political television that conveys civility and 
incivility in a manner more similar to face-to-face inter- 
action, that is, with a rich array of cues tied to language, 
speech, and expression. 


A THEORETICAL BASIS FOR 
VIDEOMALAISE 


To trust is to assume that a person or institution will 
“observe the rules of the game” (Citrin and Muste 1999, 
465) and to believe that those involved will act “as they 
should” (Barber 1983). But what does that imply in the 
context of televised political disagreement? We pro- 
pose this means that people expect political actors who 
appear on television to abide by the same social norms 
acknowledged by ordinary Americans. To be sure, the 
actual norms for behavior on television are very differ- 
ent from what they are in the world of everyday social 
interaction. But we hypothesize that when political ac- 
tors violate interpersonal social norms on television, 
viewers react as they would if they were witnessing 
the same interaction in real life. In other words, their 
emotional reactions are not mediated by the cognitive 
acknowledgement that this is “only television,” and 
thus they react quite negatively to incivility.* Below we 
review past evidence that supports this idea, beginning 
with what we know about reactions to incivility and 
disagreement in real life. 

In face-to-face settings where people disagree about 
politics, there are strong social norms likely to be ob- 
served for purposes of these interactions. Face-to-face 
exchanges are relatively polite. Although people occa- 
sionally yell at one another and stomp their feet over 
political differences, such behavior is far more common 
in mediated presentations of political views. Norms in- 
volving politeness are extremely strong (Brown and 
Levinson 1987); most people are polite most of the 
time. 

The experience of political conflict on television 
often differs substantially from real life. Increased 
market competition has encouraged political shows 
to “liven themselves up” in order to increase au- 
dience size (Fallows 1996). Programs such as the 
McLaughlin Group, The O’Reilly Factor, Meet the 
` Press, Crossfire, Capital Gang, and Hardball depict par- 
ticularly intense and heated exchanges. Even standard 
news programs are increasingly characterized by an 


* The hypothesis that television 1s able to elicit a greater intensity 
of emotional reactions has been around for some time, but it has 
met with mixed success in empirical studies (see Hibbmg and Theiss- 
Morse 1998 for a renew). This study addresses a narrower, more 
specific hypothesis and proposes a specific mechanism of mfluence 
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emphasis on controversy and contentiousness. Civil 
and polite exchanges of opinion do occur on televi- 
sion, and screaming sometimes occurs in interpersonal 
discussions surrounding politics, but the central ten- 
dency in media is to highlight emotionally extreme and 
impolite expressions, whereas the central tendency in 
face-to-face communication is toward polite and emo- 
tionally controlled interactions. Consistent with this 
evidence, Hibbing and Theiss-Morse (1998) find that 
people who obtain news from television or radio gener- 
ate significantly more negative emotional evaluations 
of Congress, though their cognitive evaluations are in- 
distinguishable. 

Polite manners and other pleasantries may seem ex- 
traneous to political trust, but the need for politeness is 
particularly great when expressing controversial views. 
As Kingwell (1995) suggests, politeness and civility are 
not arbitrary norms akin to using the correct fork; they 
are a means of demonstrating mutual respect. Empiri- 
cal studies substantiate the importance of adhering to 
social norms of politeness. Tyler (1990), for example, 
finds that politeness and respect toward individuals in- 
volved in a legal conflict enhances their perceptions of 
fair treatment. 

In addition to being less civil, the television world 
also provides a uniquely intimate perspective on 
conflicts, Interestingly, in the literature on human prox- 
emics, the distance deemed appropriate for face-to- 
face interactions with public figures in American cul- 
ture is beyond 12 feet (see Aiello 1987), yet most 
citizens’ exposure to politicians via television has the 
appearance of being far closer. When people argue, it is 
typically unpleasant, and the tendency is to back off, es- 
pecially if one is personally involved in the argument. In 
contrast, as televised political conflicts intensify, cam- 
eras Close in with tighter and tighter perspectives on 
the people involved. This creates a highly unnatural 
experience for viewers, one in which they view conflict 
from an extremely intimate perspective, and one that 
would be highly unlikely to occur in the real world. 
When social norms for civility are violated on televi- 
sion, the viewer’s intimate perspective intensifies an 
already negative reaction to incivility.° 

Of course, this hypothesis rests on the assumption 
that watching conflict on television is much the same 
experience for citizens as when conflict is face-to-face. 
While this may seem unlikely, an accumulation of evi- 
dence suggests otherwise. Reeves and Nass (1996, 13), 
for example, have documented responses to mediated 
images that parallel those found in the natural world. 
For example, when motion appears on a television 
screen, physiological responses are the same as when 
motion occurs in the immediate environment. When 
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viewers see a picture of a person on a screen who 
appears to be closer by virtue of his or her size on 
the screen, evaluations of the person on the screen 
are more intense, just as they are when real people 
come closer (see also Lombard 1995 and Lombard 
et al. 2000). Based on these experiments, Reeves and 
Nass (1996) conclude that the human brain has not 
evolved quickly enough to respond in rational ways to 
technologies such as television. As they put it, “People 
expect media to obey a wide range of social and natural 
rules. All these rules come from the world of world of 
interpersonal interaction.” 

Television and film, more than newspapers or radio, 
provide an approximation of human experience in 
terms of visual and aural sensory input. Moreover, re- 
search suggests that people’s default reactions to televi- 
sion tend to be primitive and automatic. The prospects 
for unthinking and automatic social responses are 
preatest with television because it more closely repli- 
cates human experience (see Lang 2000). The visual 
element of television encourages “gut reactions” on 
the part of viewers, which are not mediated by cog- 
nitive assessments (see Sullivan and Masters 1987).° 
Although people may be well aware that different so- 
cial norms characterize televised interactions, they may 
nonetheless respond to televised depictions of political 
actors based on norms for “real” life. As Mansbridge 
(1983) has noted, when there is open political conflict 
in real life, bringing people together in one another’s 
presence can intensify their anger and aggression. To 
the extent that a television presence has similar effects, 
we would expect greater negativity to result. 

To summarize, when encountering differences of 
opinion in person, the tendency is for people to down- 
play their differences and maintain a polite, cordial 
atmosphere; in contrast, mediated portrayals of po- 
litical conflict emphasize strong differences of opin- 
ion, at least in part to enhance dramatic value and 
attract viewers. Television further intensifies the nega- 
tive, conflictual aspects of the experience by causing 
viewers to experience uncivil exchanges of political 
views from a highly intimate perspective. The cen- 
tral hypothesis in this study is that this violation of 
social norms should cause negative reactions toward 
politicians and government. People expect others to 
obey social norms and evaluate them less favorably 
when they do not. In addition, given that the norma- 
tive expectation is of civility, this influence should be 
primarily a function of negative reactions to incivility, 
rather than positive reactions to civility. Finally, we fur- 
ther hypothesize that television viewers will respond 
negatively to incivility on television due to largely 
gut-level, emotional reactions to violations of social 
norms rather than a cognitive awareness of excessive 
conflict. 


6 Although these studies focused specifically on the effects of poltti- 
clans’ nonverbal facial displays on public attitudes, theur findings 
likewise suggest that expressive displays have a direct emotional 
impact on viewers (see McHugo et al 1985 and Sullivan and Masters 
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STUDY DESIGN 


Experimental methods provide the best means for test- 
ing this hypothesis. First, problems with the validity and 
reliability of self-reported media exposure are well- 
known (Price and Zaller 1993). An expectation that 
people can accurately report levels of exposure specif- 
ically linked to televised incivility is unreasonable and 
beyond most people’s capacities. Moreover, an experi- 
mental design makes it possible to evaluate the impact 
of a subtle variation in the presentation of political 
conflict—the level of civility in the interaction—while 
holding the substantive content of the disagreement 
constant. In addition, as demonstrated by the third ex- 
periment in this study, a laboratory setting made it pos- 
sible to use an indirect means of gauging the intensity of 
subjects’ reactions to a televised interaction. Measur- 
ing viewers’ physiological reactions to civil and uncivil 
political conflict provided insight into the psychologi- 
cal processes underlying the effects we observed. Ex- 
perimental designs also were essential to draw strong 
causal inferences. Those who distrust government may 
be particularly drawn to uncivil television content; if 
so, reverse causation could be problematic, or spurious 
relationships could result from correlates of distrust 
and television watching. 

The central manipulation in our experiments was the 
extent to which politicians exchanged political view- 
points in a manner that violates the typical norms gov- 
erning face-to-face political conflict. In three experi- 
ments using adults and undergraduate subjects,’ we 
exposed viewers to systematically different versions of 
four different political disagreements that were drawn 
from a larger pool. We systematically manipulated the 
extent of civility and politeness in these expressions of 
political difference, without altering the political con- 
tent of the exchanges. We expected that when dissonant 
views were presented in an uncivil fashion, they would 
encourage more negative, distrustful attitudes toward 
politics and politicians. 

To produce stimuli for these studies, professional ac- 
tors were hired to play the roles of two congressional 
candidates. A television studio with a political talk 
show set was used to tape a mock program in which 
the disagreements appeared. While seated around a 
common table on the set, a moderator directed ques- 
tions to the two candidates. The cover story was that 
Bob Lindzey and Neil Scott, two candidates for an open 
congressional seat, were invited to appear on Indiana 
Week In Review in order to familiarize potential voters 
with their positions. 


7 Adult subjects were recruited through temporary employment 
agencies, and they were paid for their partiapation by the agency 
at the hourly rate they had agreed on with the agency. Student 
subjects were recruited from political science courses as part of a 
class opportunity for extra credit. All subjects were mvited to partic- 
ipate “in a study that mvolves watching television.” In Experiment 
1, 75% of the subjects were college students, and the remaining 25% 
were recruited from the community In Experment 2, 45% of the 
subjects were students, and 55% were drawn from the community. 
In Experment 3, all subjects were recruited from the community 
We found no systematic difference ın the reactions of student and 
nonstudent subjects 
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In order to ensure the same political content 
in the civil and uncivil versions of the discussion, 
teleprompters were used to help the actors adhere 
closely to a script. The script drew on arguments from 
interest groups that were for and against the issues. 
Candidates were always assigned to opposing views 
on each of the issues, and they were both directed 
to interact in either a civil or an uncivil fashion. In 
order to avoid subject fatigue, only a subset of four 
of the issue disagreements was used in each experi- 
ment. Experiment 1 included discussion of free trade, 
mental health insurance, Internet privacy regulation, 
and NASA funding. Experiment 2 covered tobacco 
regulation, taxation of retail sales on the Internet, pub- 
lic service experience, and free trade. The issues dis- 
cussed in Experiment 3 included regulation of the to- 
bacco industry, Internet taxes, the repeal of the Glass— 
Steagall Act, and whether public service experience 
is an important qualification for office. Each issue ex- 
change was edited so that it was roughly five minutes 
long. 

‘Two versions of each exchange were taped on the 
talk show set. The candidates expressed exactly the 
same issue positions in the same words in both versions, 
and offered exactly the same arguments in support 
of their positions. But in the civil version the candi- 
dates went to extremes to be polite to the opposi- 
tion, inserting phrases such as “I’m really glad Bob 
raised the issue of...” and “I don’t disagree with all of 
your points, Bob, but...” before calmly making their 
own positions clear. Both candidates fully observed 
the interpersonal norms for civility in expressing their 


FIGURE 1. 


Source’ Experiment 2. 


Note. Compansons are of means on a zero-to-eight scale All crl/unerl com 
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viewpoints, not only in their own speech, but also by 
waiting patiently while the other person answered and 
by paying attention to the opponent while he was 


8- 

In the uncivil version of these exchanges, the can- 
didates used the same script but inserted gratuitous 
asides that suggested a lack of respect for and/or frus- 
tration with the opposition. Sample statements include 
comments such as “Yov’re really missing the point here 
Neil” and “What Bob is completely overlooking is....” 
The candidates also raised their voices and never apol- 
ogized for interrupting one another. Nonverbal cues 
such as rolling of the eyes and rueful shaking of the 
head from side to side were also used to suggest lack 
of respect for what the opponent was saying. Voices 
were raised when conflict intensified, in contrast to the 
persistently calm voices of the candidates in the civil 
version. 

Manipulation checks confirmed that the two versions 
of each issue discussion were perceived as significantly 
different in levels of civility. In Experiment 1, two 
items confirmed that subjects in the uncivil condition 
perceived Neil and Bob to be significantly less po- 
lite than in the civil condition (t = 1.96, p < .05). In 
Experiment 2, subjects were asked to evaluate the 
speakers on an expanded battery of manipula- 
tion check items, including adjective pairs ranging 
from “quarrelsome” to “cooperative,” “friendly” to 
“hostile,” “emotional” to “unemotional,” “calm” to 
“agitated,” and “rude” to “polite.” As shown in 
Figure 1, the candidates in the uncivil versions of the 
issue exchanges were consistently perceived as less 


Manlpulation Checks for ClviV/Unclivil Issue Debates 


OCh E Unctvil 


nonpoftical differences were signif- 


parisons 
cantly different from one another. The ‘values for the compartsons from the top to the bottom of the figure were as follows: t = 7.04, 
p < 001; f= 8 83, p < .001; t = 3.07, p < .01, f= 4.51, p < 001, and t= 5.29, p < .001. Means for ideology and partisanship did not 


differ significantty by condition. 
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polite, more quarrelsome, more emotional, agitated, 
and hostile. Equally important are the two null find- 
ings at the bottom of Figure 1. There were no per- 
ceived ideological differences in a candidate’s stand 
as it appeared in the civil versus the uncivil condition, 
thus affirming our effort to keep the political substance 
espoused by each candidate constant. 


Procedures 


Participants sat alone in a room with a small, comfort- 
able couch and an overstuffed chair, plus a coffee table 
and a 32-inch television. After consent to participate 
was obtained, they filled out a pretest questionnaire. 
Subjects then viewed 20 minutes of the televised politi- 
cal program, incorporating the same scripted disagree- 
ments on four different issues,’ and, afterward, filled 
out posttest questionnaires. Subjects were debriefed 
after all experiments. They were told that their answers 
were being compared to those of others who watched a 
different version of the same program. But subjects in 
Experiment 2 were not told our hypotheses in order to 
facilitate follow-up interviews by phone for those who 
consented. 

Although the experimental procedure was basically 
the same in the two experiments, in Experiment 1 sub- 
jects were randomly assigned to civil or uncivil condi- 
tions (n = 67), while in Experiment 2, we incorporated 
a third, control condition in which subjects did not 
watch any political television (n = 155)."” The control 
group was included in the second experiment to de- 
termine whether differences between the civil and the 
uncivil conditions were due to an elevation of political 
trust in the civil condition, a decrease in trust in the 
uncivil condition, or some combination thereof. The 
participants randomly assigned to the control group 
watched a nonpolitical program for the same amount 
of time that the treatment groups watched the talk 
show!!, and filled out the same questionnaires.” In Ex- 
periment 3, we focused on understanding the process 
of influence that accounts for the pattern of findings 
in the two previous experiments. For these purposes 
we utilized a within-subjects experimental design in 
which all subjects were assigned to both the civil and 


8 Although we refer to the two versions as “civil” and “uncivil,” it 
should be noted that relative to contemporary political talk shows, 
even the “uncivil” version would not be viewed as extremely hostile. 
Relative to exchanges between candidates and members of Congress, 
the portrayals in these experiments were highly realistic and well 
within the range of what people expect of congressional candidates. 
None of our subjects questioned their authentaty. 

9 In the first experment, the order of presentation of the four issue 
discussions was randomized In the second experiment, we used a 
fixed order (free trade, tobacco, Internet taxes, and public service) 
10 This time the list of issues included NASA funding, mental health 
insurance, a federal Internet privacy policy, and free trade. 

11 The nonpolitical program was an instructional video on how to 
improve one’s free throw shooting ability, combined with a brief 
history of the NBA AI tapes are available for viewing upon request. 
12 The issues discussed in Experiment 2 were restrictions on tobacco 
advertising, free trade, taxes on Internet purchases, and whether 
previous public service/political experience is an asset or a liability 
for members of Congress. 
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the uncivil conditions in randomly determined orders 
for the discussion of four different issue disagreements. 


Measuring Political Trust. The best-known measures 
of political trust are those from the National Election 
Studies battery that are labeled under the rubric of 
support for the political system. We utilized these mea- 
sures as dependent variables in our study, though not 
exclusively so. To address all of the various types of 
trust that have been linked to political television, we 
included questions to form three separate political trust 
indexes tapping Trust in Politicians, Trust in Congress, 
and Trust in the Political System. Although it is custom- 
ary to define different kinds of trust by virtue of the 
target being trusted (the political system, politicians 
in power, etc.), we found, consistent with Citrin and 
Muste (1999, 467), that while items used to measure 
political trust may have different target objects, they 
are of “dubious discriminant validity.” For this reason 
we do not claim to have measured empirically distinct 
concepts, nor does our theory make differential predic- 
tions. However, because indicators have traditionally 
been grouped by target object, we did the same in our 
initial analyses, but also provide a pooled analysis that 
combines the three dimensions of political trust.” 

Our theory suggests that people will be less likely 
to trust politicians if they are seen as uncivil in their 
interactions with one another—even though that may 
well be considered the norm for political television. 
When political actors do not behave according to so- 
cial norms, viewers are reminded that politicians do 
not appear to abide by the same rules of the game 
as everyone else. Norm violations should generate a 
negative reaction, much as they do when people are 
exposed to such violations in real life. 


RESULTS 
Experiment 1 


The goal of the first experiment was to assess whether 
the extent of civility in a televised exchange of political 
views influences judgments of political trust. Data for 
Experiment 1 were analyzed by testing the difference 
between mean levels of trust in the two conditions. As 
shown in Figure 2, all three trust measures were signifi- 
cantly influenced in a negative direction by the less civil 
exchange. Trust in Politicians became more negative, 
even though the content of the talk show made it clear 
that neither Bob nor Neil currently was, nor had ever 
been, a member of Congress (F = 10.35, p < .001). 
Likewise, Trust in Congress also became more negative 
after only one 20-minute exposure (F = 6.00, p < .01). 
Finally, attitudes toward the American political system 
also were significantly, though more modestly, influ- 
enced by the civility of discourse (F = 3.12, p < .05). 


D In Experiment 1, the individual scales all achieved respectable 
levels of reliability (see Appendix), but in Expenment 2, with a 
much higher proportion of subjects drawn from the community, they 
did not, unless combined into a single index of political trust. 
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FIGURE 2. Effects of Incivility on Trust In Government and Pollticlans 


OCivil W Uncivil 


Sources. Experiment 1. 


Nota: Differences between crvil and uncrvil conditions were consistently significantly different In the expected direction (F = 10.35, 
p < .01, F = 8.00, p < 01; and F = 3.12, p < 05). Corresponding partial eta-squared values were .14, .08, and 05. 


We have suggested that viewers are experiencing a 
visceral, gut-level negative reaction to violations of so- 
cial norms. But others have proposed what might seem 
a far simpler explanation: people simply dislike conflict, 
and they see it as particularly unnecessary when the 
answers to many political problems seem obvious to 
ordinary people (e.g., Hibbing and Theiss-Morse 2002). 
Perhaps, then, what we observe in Figure 2 is incivility 
in public discourse leading people to lose respect for 
the idea that free and open debate is an integral part 
of our political process. If people often see poor exam- 
ples of political disagreement, then attitudes toward 
conflict and its importance in the democratic system 
would naturally become more negative. 

To evaluate this alternative explanation, we used two 
scales tapping attitudes toward political conflict (see 
Appendix for details); one of these tapped attitudes 
toward congressional debate (including items such as 
“Members of Congress bicker a lot more than they 
need to”), and the second index tapped attitudes to- 
ward the importance of free and open political debate 
(e.g., “It’s very important that politicians air their dif- 
ferences of opinion publicly”). 

The civility of discourse in the experimental manip- 
- ulation did not have any influence on evaluations of 
conflict in either case. People apparently differenti- 
ated between the importance of public conflict—that 
is, the exchange of differing views, which remained 
constant across the two presentations—and the civility 
of that conflict, which varied significantly. Popular dis- 
cussions often conflate the extent to which disagree- 
ment takes place with the civility of those interac- 
tions, thus conflating greater hostility with greater 





conflict. But in this study neither the combined 
scales nor any one of the many individual items tap- 
ping attitudes toward conflict varied by experimental 
condition. 

The fact that viewers did not become more nega- 
tive toward conflict lends support, albeit indirect, to 
our assertion that viewers’ reactions stem from a vis- 
ceral, gut-level negative reaction to incivility, one that 
originates in face-to-face social norms, but that is ap- 
plied without thought to televised politicians as well. In 
order to examine evidence bearing more directly on 
this hypothesis, we made use of a scale from the pretest 
questionnaire that assesses the extent to which individ- 
uals approach or avoid situations involving conflict in 
their everyday face-to-face discourse with others. The 
Conflict Approach/Avoidance Scale was developed to 
predict an individual’s willingness to make interper- 
sonal conflicts explicit. It is known to have high relia- 
bility levels, minimal social desirability bias, and pre- 
dictive validity based on willingness to participate in 
conflict-related interventions, such as mediation. The 
scale is also sensitive to different cultural norms re- 
lated to conflict-related communication (see Goldstein 
1999). The items included in this scale tap the extent 
to which an individual enjoys challenging the opinions 
of others, feels upset after an argument, and so forth 
(see Appendix for details). If our theory is correct, 
this individual difference should exacerbate reactions 
to incivility. But if watching politicians on television 
as purely a third-party observer bears no relation to 
how people react when they themselves are personally 
involved in disagreements, then reactions to televised 
political incivility should not be contingent on what 
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FIGURE 3. Size of Effect on Trust (Civl-Unclvil Presentation) by Level of Conflict Avoldance 
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Source. Experiment 1. 


Note. Pattern of results from analysis of variance Including Cmllity (2) and Conflict Avoidance (3). Results demonstrate a significant 
main effect for treatment (F = 10.37, p < 01, partial eta-squared = 15), as well as a significant interaction between Conflict Avoidance 


and Civility (F = 5.81, p < .01, parbal eta-squared = .16) 


makes them uncomfortable in interpersonal interac- 
tions. 

In order to see if responses to civility and incivil- 
ity are different for those with varying propensities 
for conflict avoidance, we divided the conflict avoid- 
ance scale into equal thirds of low, medium, and high 
levels of conflict avoidance and then used an anal- 
ysis of variance model including the main effects of 
civility/incivility and of conflict avoidance, plus the in- 
teraction between conflict avoidance and civility. We 
hypothesized that if the effects shown m Figure 2 were 
truly based on a gut-level aversion to incivility in dis- 
agreements, then those most conflict-averse in face- 
to-face contexts should react most strongly to viewing 
uncivil discourse as well. For purposes of this analysis, 
we combine the three measures of political trust, given 
that they are highly intercorrelated and generate iden- 
tical patterns of results. Moreover, the reliability of the 
general index of political trust is higher than any of the 
individual indexes (a = .84). 

Figure 3 shows the differences in levels of political 
trust between the civil and the uncivil conditions, bro- 
ken down by low, medium, and high levels of conflict 
avoidance. As suggested by this pattern, the analysis 
of variance model generated not only the sig- 
nificant main effect for civility that was already 
observed (F = 10.37, p < .01), but also a highly sig- 
nificant interaction between conflict avoidance and in- 
civility (F = 5.81, p < .01). Moreover, the partial eta- 
square values indicate that the effect of the interaction 





was, if anything, larger than the impact of incivility 
alone. 

As shown in Figure 3, the difference between levels 
of political trust in the civil versus the uncivil condi- 
tion was a function of individual differences in conflict 
avoidance. For those who are generally uncomfortable 
with face-to-face disagreement, the uncivil condition 
generated much lower levels of political trust than 
the uncivil condition. For those with moderate levels 
of discomfort with conflict, we observed a somewhat 
smaller differential. And for those who find disagree- 
ments somewhat enjoyable, the pattern was reversed in 
that the uncivil condition generated slightly higher lev- 
els of political trust than the uncivil one. This pattern of 
results lends credence to our interpretation of findings 
based on the impact of personally “experiencing” dis- 
agreement via television viewing. Even though viewers 
are not personally involved in the political disagree- 
ments they view, they nonetheless react in the same 
fashion when televised disagreements violate face-to- 
face norms. 


Experiment 2 


Our second experiment was designed to replicate and 
extend the findings from the previous experiment. In 
this study we recruited a much larger number of par- 
ticipants (n = 155), with many adult subjects recruited 
from the community, and we extended the design to 
include a control condition that viewed neither version 
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of the political disagreements. We also used a different 
set of issues in the talk show to ensure that our results 
did not hinge on any one particular stimulus. Finally, 
by including pretest assessments of the two candidates 
based on their pictures alone, we also strengthened 
our ability to claim that observed differences are due 
to the candidates’ behavior during the televised pro- 
gram rather than simply effects of being exposed to 
pictures of the two politicians. An analysis of variance 
confirmed no significant differences among the three 
conditions in pretest evaluations of the candidates. 

In analyzing Experiment 2, our initial focus was 
on determining whether the findings in Experiment 1 
could be replicated using different stimuli to represent 
civil and uncivil discourse. Table 1 shows the means 
for the three separate indexes of political trust, plus 
the combined index, broken down by civil and un- 
civil conditions. Despite the fact that the issues dis- 
cussed in these stimuli were different from those in 
Experiment 1, the effects of incivility on political trust 
replicated across all three indexes. The results are vir- 
tually identical to those found in Experiment 1. The 
civil presentation resulted in significantly higher lev- 
els of political trust than the uncivil exchange of the 
same political views. This pattern extended to attitudes 
toward Congress, politicians in general, and the en- 
tire U.S. system of government (F = 2.61 and p < .05, 
F =3.84 and p < .05, and F =3.13 and p < .05, for 
Congress, politicians, and government, respectively). 

In order to determine whether civility was increasing 
levels of political trust and/or incivility was decreasing 
levels of trust, we compared the experimental means 
to the control conditions using simple contrasts. Al- 
though the general pattern of results for two of the 
three trust measures in Table 1 is suggestive of more 
positive assessments from civil presentations and more 
negative ones from uncivil ones, comparisons with the 
control group means showed that the difference from 
the control group was significant only in the uncivil 
condition. Arguably, this null result could be due to the 
substantially lower reliability of the indexes obtained in 
Experiment 2 (see Appendix), which probably resulted 


TABLE 1. Comparison of Civil, Uncivil, and 
Control Means 
Unelwl Control Chil F 
Condition Condition Condition value 
Trust In —.15 3 86" 
Polticlans (58) (155) 
Trust In —.09 2.71* 
Congress (58) (155) 
Trust In System —.17° 2.96* 
(155) 
Polrtical Trust — 25° : ; 5.67* 
Index 58 155 
Source. Expenment 2. 
Note: Cell entnes are means, with sample sizes in parentheses. 
AJI uncril and cvil means are significantly different from one 
another at the p < .05 level. 
CConditions that are significantly different from the control con- 
diton for that variable at the p < .05 level. 


of Government (58) 





from the more diverse sample of subjects in this experi- 
ment. To further confirm the control group comparison, 
we improved the reliability of the dependent variable 
by combining the three highly intercorrelated sets of 
items into a single index of Political ‘Trust, as was done 
in Experiment 1. This strategy succeeded in increasing 
the reliability to acceptable levels (a = .75). But even 
with the more reliable index, there still is no significant 
difference between the control condition and the civil 
presentation, and there remains a significant difference 
between the control and the uncivil conditions (index, 
F = 3.05, p < .05; planned comparisons between civil 
and control, p > .60, and between uncivil and control, 
p < .05). 

This pattern is consistent with the theory offered as 
to why people react negatively to incivility. Although 
cognitively they may avow that the norm on television 
is incivility, their expectations are based on the world 
of interpersonal interaction, where civility is what is 
expected. Incivility creates important deviations from 
people’s default assumptions. Thus the civil condition 
does not differ from the control condition, while the un- 
civil condition does. This pattern supports the assertion 
that differences are caused by television’s departure 
from interpersonal social norms. Viewing a civil in- 
teraction leaves political trust unchanged, but viewing 
an uncivil interaction—even for a mere 20 minutes— 
significantly lowers levels of political trust over not 
viewing at all. 

Based on this evidence, can we be confident that 
viewers are reacting for the specific reasons we have 
posited? Does viewing an uncivil political disagree- 
ment produce more negative feelings because it vio- 
lates norms for face-to-face discourse? Thus far three 
findings bolster support for this interpretation. First, 
this effect is most pronounced among those who try to 
avoid interpersonal conflict in their everyday lives— 
regardless of whether it is political or not. Second, this 
pattern is clearly not a function of the impact the civility 
manipulation has on attitudes toward the importance 
of conflict per se; indeed we find no effects on atti- 
tudes toward the desirability of political conflict, de- 
spite our best efforts. Even immediately after viewing 
a far-from-exemplary demonstration of the exchange 
of political differences, viewers are no more likely to 
see political conflict as petty, unnecessary bickering. 
Finally, we also find that it is incivility in particular, the 
counter-normative behavior that departs from face-to- 
face expectations, that accounts for most of the neg- 
ative effects observed. People expect political actors 
to act in a predictable manner, an expectation based 
on the world of face-to-face interaction, where civility 
is the norm. When politicians do not act according to 
these expectations, they create negative reactions in 
viewers. 

But is exposure to disagreement coupled with in- 
civility truly producing the visceral, gut-level reaction 
that we have hypothesized? A far simpler cognitive 
explanation is that incivility simply lowers the esteem 
with which viewers regard candidates. Viewers think 
less of those who behave in an uncivil fashion, and those 
candidates, as representatives of the larger category of 
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political actors, lower the cumulative impression that 
viewers have of politics and politicians. The evidence 
from these experiments is not consistent with this incre- 
mental explanation because overall attitudes toward 
the candidates (as measured by combined thermome- 
ter ratings for the two candidates) were not systemat- 
ically affected by incivility. In Experiment 1, attitudes 
toward one candidate were higher in the civil condition, 
while attitudes toward the other were unaffected. In 
Experiment 2, thermometer ratings were no higher in 
the civil than in the uncivil condition. If citizens were 
merely rationally updating their views about politicians 
on the basis of these two men appearing less likable on 
the whole in the uncivil condition, then this is not the 
pattern of findings one would expect. 


Experiment 3 


To more directly test the idea that these reactions are 
rooted in emotional, gut-level responses to viewing 
political incivility, we ran a third experiment tapping 
viewers’ physiological reactions to the same programs. 
Using small electrodes attached to viewers’ hands 
while they watched the programs, we recorded levels 
of electrodermal activity, known as skin conductance, 
throughout the viewing. Skin conductance measures 
are a widely accepted measure of physiological arousal 
and emotional response (see Dawson, Schell, and 
Filion 2000) and have been used extensively in stud- 
ies involving arousal responses to media (e.g, Lang 
2000 and Reeves et al. 1999). Exposure to conflict in 
face-to-face settings causes increased arousal, and in- 
civility creates even higher levels, as if the human body 


were preparing for a response. If our theory is correct, 
simply viewing others arguing in an uncivil fashion 
should cause this same physiological reaction, even 
though it makes no apparent sense to prepare for ac- 
tion in response to mere pictures. The uncivil political 
conflict portrayed on television obviously cannot burst 
into our living rooms to threaten us, yet our brains may 
respond as if this were exactly what might happen. 

Because there are often large individual differences 
in skin conductance levels, we used a within-subject 
design for Experiment 3 so that people watching un- 
civil exchanges could each be compared to themselves 
watching civil disagreements. A within-subject design 
also allowed us to obtain findings from a relatively 
small sample of 16 subjects. In Experiment 3 each sub- 
ject viewed two issue debates in a civil version and two 
in an uncivil version. The order of issues and assign- 
ment of a given issue to the civil or uncivil condition 
for each subject were completely randomized in a Latin 
Square design to cancel out potential issue and/or order 
effects. Three hundred data points for skin conductance 
level were collected across the five-minute period. We 
then analyzed the effects of the civility manipulation 
on these data while controlling for time as a repeated- 
measures factor. 

Figure 4 illustrates average levels of skin conduc- 
tance over the five minutes of issue discussion. As with 
reactions to other new stimuli, skin conductance levels 
are highest at the onset of a stimulus presentation and 
then decline monotonically. Time was a highly signifi- 
cant factor in the analysis given that more time consis- 
tently produces lower levels of arousal with any form of 
media stimuli. But when people viewed issue debates 


FIGURE 4. Physiological Arousal by Civili and Uncivil Conditions 
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Source. Expenment 3. 


Note. Points represent averages based on cvil and uncivil means Inemilrty produced significantly higher levels of physiological arousal 


(t= 14.38, p < .001). 
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FIGURE 5. Attttudes Toward the Polltical Program, by Levels of Civility 
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Note. Both vanables are measured on one to five-point scales. The means for Percerved Informatveness do not differ significantty 
(F = .01, p > 05). The means for Perceived Entertainment Value are significantty different in the cml and uncmwl conditlons (F = 14.79, 


p < .001, partial eta-equared = .11). 


in their uncivil versions, the same exchanges caused 
significantly higher levels of arousal, just as our theory 
predicted (t = 14.38, p < .001). Over the entire series, 
issue presentations shown in the uncivil version were 
significantly more emotionally arousing than those in 
the civil version. Because the subjects in Experiment 3 
were exposed to both civil and uncivil discourse, we 
did not predict effects on levels of political trust in 
Experiment 3. Overall, any such effects should have 
cancelled one another out by the time of the posttest. 

Given that politics is not a highly arousing topic for 
most citizens, and that this program was of the rel- 
atively unexciting, yet common, “talking heads” va- 
riety, it is noteworthy that incivility applied to stan- 
dard political issues can produce primitive reactions 
in those who are merely viewing others engaged in 
disagreement. Television viewers are notoriously pas- 
sive, “third-party” spectators when it comes to politics. 
And yet, at a very basic physical level, they are react- 
ing to incivility on the screen as if it were very real 
indeed. 


Popularity of the Program 


The finding shown in Figure 4 begs the additional ques- 
tion of whether arousal is tapping characteristics of pro- 
grams that also make them popular with viewers. Mea- 
sures available in Experiment 2 allowed us to examine 
this hypothesis. One battery of five items addressed the 





Perceived Entertainment Value of the program, and 
another battery of five items tapped its Perceived Infor- 
mativeness (see Appendix). As shown in Figure 5, the 
civil and uncivil versions of these discussions were per- 
ceived as equally informative, and rightly so given that 
they included precisely the same amount of substan- 
tive political information. But overwhelmingly, viewers 
found the uncivil version of this public affairs talk show 
more entertaining, indicating that it was more interest- 
ing, More exciting, and so forth (F = 15.61, p < .001, 
for main effect of incivility). They also indicated a much 
weaker to desire to view the program again in the fu- 
ture if they viewed the civil version. Unfortunately, the 
kind of presentation of political conflict that is likely 
to attract audiences and build television revenues does 
not appear to be the one that best serves democratic 
citizens. 


DISCUSSION 


Experiments 1 and 2 consistently demonstrate that in- 
civility in televised political discourse has adverse ef- 
fects on political trust relative to civil discussions of the 
same political substance. The effects we have isolated 
are surprising, in part because of the brief nature of the 
stimulus—a 20-minute television program—and in part 
because political trust is typically regarded as a more 
stable attribute of individuals, one that changes slowly 
and incrementally, if at all, or perhaps suddenly in the 
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case of startling news events such as the Watergate 
scandal in the 1970s. Our studies suggest a surprisingly 
high level of malleability in political trust. If even a 
brief exposure to political debate can produce these 
systematic changes, it is clearly more volatile than pre- 
viously thought. 

This observation naturally leads to the question of 
whether these laboratory-induced effects persist over 
time. Experiment 2 provided an opportunity to ex- 
amine this question. Our experimental subjects were 
asked at the time of their participation if they would 
consent to a follow-up phone call at home after the 
experiment itself, and just over 60% of the subjects 
agreed and provided phone numbers where they could 
be reached. The Center for Survey Research at Ohio 
State University conducted the follow-up interviews, 
which included the same battery of trust questions 
as in the posttest questionnaire administered in the 
laboratory.’ Given the malleability observed in the 
laboratory, we anticipated that the effects of the ma- 
nipulations should have dissipated by the time of the 
phone interviews (see Druckman and Nelson 2003). 
This pattern was expected because subsequent expo- 
sure to political discourse should swamp the influence 
of any short stimulus. 

The findings from this analysis basically confirmed 
our hypothesis.» The uncivil group “bounced back” 
to the level of trust in the posttest control group, that 
is, back to the level of trust it had before incivility in 
the laboratory temporarily depressed levels of politi- 
cal trust in this group. The civil group, which tended 
to be slightly, though not significantly, more trusting 
than the control group in the initial posttest, lowered 
its levels of trust somewhat in the follow-up interview 
relative to movement in the control group. By the time 
of the follow-up interview, there were no significant 
differences by original experimental condition. 

These findings suggest that the effects we observed in 
the lab may be transient and short-lived. But given that 
our brief manipulation was designed to simulate the 
persistent, often rancorous tenor of televised political 
interactions, we may already be witnessing substantial 
ongoing effects based on television’s tendency toward 
dramatic conflict. Indeed, if one envisions the cumu- 
lative impact of repeated exposure to incivility, then 
low levels of political trust are not surprising. On the 
other hand, to the extent that civil political discourse 


14 Interviews were attempted roughly three weeks after the exper- 
ment and were completed, on average, one month after participation 
occurred Although attrition in sample size between the laboratory 
results and the follow-up interview reduced the strength of our orig- 
inal laboratory findings, the posttest means for the subsample that 
was observable at both times maintained the same initial pattern of 
significant differences among civil, uncivil, and control conditions 
(F = 2 61, p < .05). 

15 To analyze these data, we used a two factor mixed-model analysis 
of variance with one repeated measure, within-subjects factor (levels 
of political trust immediately after the expermment and one month 
later), and one between-subjects factor (experimental condition), 
plus the mteraction between the two. Although the interaction be- 
tween experimental condition and time approached statistical signif- 
icance, this finding occurred because of differential change over time 
by expermental condition. 


12 


February 2005 


is equally prevalent, one would expect these effects to 
cancel themselves out. 

The generalizability of these findings outside of the 
laboratory, outside of the collection of stimuli used 
in this experiment, and outside of these subjects also 
warrants consideration. With respect to experimental 
subjects, we are on stronger-than-usual footing because 
we did not rely exclusively on college student subjects. 
In addition, our subjects were drawn from the larger 
community in a medium-sized metropolitan area. They 
incorporated a range of ages, educational levels, and 
socioeconomic backgrounds, and we found no differen- 
tial effects between subjects obtained through different 
means of recruitment. 

We used a political talk show as a realistic pretext 
for these stimuli, but not out of a desire to draw con- 
clusions about the effects of talk shows as much as to 
evaluate the consequences of uncivil discourse more 
generally. Uncivil discourse can and has appeared on 
regular news programs and in political debates, as well 
as on talk shows of various kinds. The level of incivility 
demonstrated by the political actors in our program 
was relatively mild compared to the hostility exhib- 
ited in many of today’s political talk shows, and this 
was done purposely in an effort to extend the gen- 
eralizability of the findings. Those engaged in uncivil 
repartee are typically not the candidates themselves 
but, rather, representatives from opposing campaigns 
or issue camps. These differences could have height- 
ened the sense of norm violation viewers experienced 
while watching these programs. But the requirement 
that viewers’ perceptions of the candidates’ political 
positions be identical across civil and uncivil conditions 
meant that we were limited in the extent of incivility 
that would make sense in this context. 

On the one hand, because of the laboratory setting 
our subjects undoubtedly paid closer attention to the 
television program than they might have in a more nat- 
uralistic context. Greater attention is likely to create 
greater potential for emotional reactions of the kind 
we have observed. But the types of issues discussed 
in these programs may have counterbalanced any such 
increase. None of the issues discussed were particularly 
emotion-laden issues relative to other possible topics. 
In general, one would expect issues such as same-sex 
marriage or flag burning to prompt much stronger emo- 
tional reactions. Nonetheless, regardless of which sub- 
set of issue disagreements we used, our findings were 
consistent. All of the issues used in the program had 
been in the news at the time of the taping of the mock 
talk show, and they remained topical for the duration 
of the experiments. Our pretest questionnaires showed 
that most subjects had opinions to report on most of the 
issues that were discussed. On a few topics, such as free 
trade, subjects had fairly strong views, but on the whole 
these were not the sort of issues likely to elicit strong 
reactions. If the discussion had involved so-called “hot 
button” issues, we would expect much stronger reac- 
tions. Likewise, should the discussion have occurred 
among politicians toward whom our respondents 
already had strong feelings, their reactions might have 
been stronger. 
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Our studies also add to the burgeoning literature 
on emotion and politics, and point to fruitful avenues 
for future research in this vein. Although advances in 
research on the role of emotion in politics have been 
notable in recent years (e.g., Marcus, Neuman, and 
MacKuen 2000), most use traditional survey methods 
that ask people to think about and label feelings. While 
useful in some respects, these self-reports have obvi- 
ous limitations in that they require people to engage 
in cognitive processing in order to label their emo- 
tions. Ultimately, less reflective methods such as those 
demonstrated here are also needed in order to advance 
knowledge in this area. 


CONCLUSION 


Political disagreement is inevitable and unavoidable, 
and also quite desirable from the perspective of most 
democratic theory. In this study we examined the 
hypothesis that it is the manner in which such dis- 
agreement is presented that discourages positive at- 
titudes toward politics and politicians. The results of 
these experiments show that uncivil political discourse 
has detrimental effects on political trust. Not only 
were attitudes toward politicians and Congress af- 
fected, but levels of support for the institutions of 
government themselves also were influenced. Impor- 
tantly, these effects occurred even though the extent 
of substantive disagreement/political conflict was held 
constant. 

Whether those effects are specific to televised expo- 
sure to incivility is a complex question. For example, 
a print version of this same program—even a direct 
transcript—would not be perceived as demonstrating 
the same extent of incivility as the television program, 
in which viewers witnessed nonverbal and paralinguis- 
tic cues such as Bob sneering at Neil and Neil shaking 
his head ruefully while Bob was speaking. Two newspa- 
per articles could purposely be written to come across 
as just as civil and uncivil to their readers as the civil 
and uncivil talk show was to its viewers. But then the 
newspaper articles’ substantive content and perspec- 
tive would have to be quite different from that of the 
television version. Ultimately substantive differences 
between media would make it difficult to argue that 
any observed effects of incivility are uniquely due to 
televised incivility, as opposed to differences in the con- 
tent of the messages themselves. 

Although a “critical test” of this theory may not be 
possible, we have nonetheless taken an important step 
forward in answering an overlooked, yet very basic 
question: “How do images of leaders, as seen on tele- 
vision, influence the public’s attitudes and beliefs?” 
(Sullivan and Masters 1987, 881). Our theoretical 
framework shows that television is likely to exacerbate 
effects of incivility, even if this effect is not necessar- 
ily Aynited to televised incivility. It seems doubtful, for 
example, that reading about a heated political contro- 
versy in the newspaper would cause the same extent 
of heightened physiological arousal that watching that 
same conflict on television causes (see Funk 2001). De- 


spite the fact that most viewers might acknowledge, at 
a purely cognitive level, that incivility is the norm for 
televised political debate, viewers react as if norms are 
being violated. 

Scholars may disagree about the proper interpre- 
tation of the target of distrust in these well-known 
questions, but low levels of political trust are widely 
believed to have consequences for American political 
institutions. Extremely low levels of trust may threaten 
the stability of political institutions, but there are more 
immediate consequences as well. Political institutions 
are able to function more smoothly with high levels 
of trust. Low levels of trust contribute to “a political 
environment in which it is more difficult for leaders to 
succeed” (Hetherington 1998, 791). 

So why does this detrimental state of affairs persist? 
Bill O’Reilly (2001), host of the highly contentious po- 
litical talk show, The O’Reilly Factor, suggests that inci- 
vility is essential to the success of political programs: “Tf 
a producer can find someone who eggs on conservative 
listeners to spout off and prods liberals into shouting 
back, he’s got a hit show. The best host is the guy or 
gal who can get the most listeners extremely annoyed 
over and over and over again.” By behaving outside 
the boundaries of civil discourse, O’Reilly goads his 
viewers into reacting strongly to what might otherwise 
be dull exchanges of political perspectives. 

For most people, politics on its own merits is not 
sufficiently exciting to compete with American Idol or 
E.R. for television audiences, so it requires the drama 
and tension of uncivil human conflict to make it more 
interesting to watch. Although some defenders of pub- 
lic virtue argue that the American public is chomping 
at the bit for serious, high-quality political television, 
the ratings that programs receive suggest that this is 
not the case. 

There is an obvious paradox embedded in these find- 
ings. On the one hand, viewers respond negatively to 
incivility in the judgments they make about politicians 
and government. On the other hand, they are clearly 
drawn to incivility, and enjoy watching it much more 
than civil programming. Funk (2001) equates this phe- 
nomenon to “rubbernecking,” the traffic delays caused 
by people who slow down to look at accidents or other, 
predominantly negative, events. How do we explain 
why people are simultaneously attracted to and re- 
pulsed by political conflict? At a very basic psycholog- 
ical level, aggressiveness demands attention (Bradley 
2000). And yet, in a culture where political disagree- 
ments are ideally not resolved by duels, what will pique 
viewers’ interest in political debate enough to get them 
to pay attention? 

In one sense these results can be viewed as a classic 
case of market failure; in other words, it is a situation in 
which market forces do not best serve the interests of 
democracy. But the solution to this predicament is not 
as simple as blaming the commercial structure of Amer- 
ican television. Even if all political programs looked 
like the most sober and civil political programming, 
they would do little good if no one watched. And thus 
we are left with the quandary of how to create political 
programming that is both interesting and exciting to 
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watch yet not likely to damage public attitudes in a 
significant way. 


APPENDIX: CONSTRUCTION AND 
RELIABILITY OF INDICATORS 


Note: Unless otherwise noted, all responses were on a five- 
point scale, from strongly agree to strongly disagree. 


Trust In Pollticlans (Expt 1, æ = .79; Expt 2, 
a = .53) 


(1) Politicians generally have good intentions, (2) Politicians 
in the U.S. do not deserve much respect. (3) When politicians 
make statements to the American people on television or in 
the newspapers, they are usually telling the truth. (4) Most 
politicians can be trusted to do what is right. (5) Despite what 
some people say, most politicians try to keep their campaign 
promises. (6) Most politicians do a lot of talking but they do 
little to solve the really important issues facing the country. 
(7) Most politicians are dedicated people and we should be 
grateful to them for the work they do. 


Trust In Congress (Expt 1, æ = .74; Expt 2, 
a = .29) 


(1) As far as the people running Congress are concerned, 
would you say you have a great deal of confidence, only 
some confidence, or hardly any confidence at all ın them? 
(2) How much of the time do you think you can trust members 
of the U.S. Congress to do what is right? Just about always, 
most of the time, or only some of the time? (3) Again, using 
this same feeling thermometer, how do you feel about the 
U.S. Congress? 


Trust In the Political System (Expt 1, « = .66; 
Expt 2, æ = .63) 


a At present I feel very critical of our political system. 
2) Whatever its faults may be, the American form of gov- 
ernment is still the best for us. (3) There is not much about our 
form of government to be proud of. (4) It may be necessary 
to make some major changes in our form of government in 
order to solve the problems facing our country. (5) I would 
rather live under our form of government that any other I 
can think of. 


Political Trust Index (Expt 1, æ = .84; Expt 2, 
a = .75) 


This index was a combination of items used in three sub- 
indexes. 


a. of Open Political Debate (Expt 1, 
a=./2 


(1) It’s very important that politicians air their differences 
of opinion publicly. (2) You can’t have a democracy without 
political opposition. (3) You really can’t be sure whether 
an opinion is correct or not unless people are free to argue 
against it. (4) Unless many points of view are presented, there 
is little chance that the truth can ever be known. 
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a of Congressional Debate (Expt 1, 
a = .61 


(1) Bickering among members of Congress does not help to 
solve our nation’s problems. (2) Members of Congress bicker 
a lot more than they need to. 


Conflict Avoldance Scale (from Goldstein 
1999) (Expt 1ae= 79) 


(1) I hate arguments. (2) I find conflicts exciting. (3) I en- 
joy challenging the opinions of others. (4) Arguments don’t 
bother me. (5) I feel upset after an argument. This 18 a 35- 
point scale divided into thirds representing low, medium, and 
high levels of conflict avoidance. 


Perceived Entertainment Value of Political 
Program (Expt 2, æ = .88) 


(1) In general, I found the program to be entertaining for 
a political talk show. (2) This program was sometimes very 
lively. (3) This program was dull and boring even by the 
standards of political talk shows. (4) The pace of the show 
was too slow. (5) Compared to other political talk shows, this 
one was better at keeping my attention. 


Perceived Informativeness of Political 
Program (Expt 2, a = .83) 


(1) In general, I found the program to be informative. 
(2) I learned new things about public issues from this pro- 
gram (3) This program gave me food for thought. (4) If 
I needed information about an upcoming election, I would 
watch this program. (5) I felt like I got to know the candidates 
by watching this program. (6) As a result of watching this 
program, I’d be more comfortable talking to frends about 
this race or about these issues. 


Manipulation Checks for Percelved Levels 
of Civility and Ideology 


How would you describe [Bob Lindzey/Neil Scott]? Using 
the word pairs below, please circle the dot that best de- 
scribes him. Word parr anchors on nine-point scales included 
liberal-conservative, emotional—unemotional, quarrelsome- 
cooperative, friendly—hostile, rude—polite, calm—agitated, 
and Democrat—Republican. Measures for Neil and Bob were 
combined for purposes of the manipulation check. 
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Who Surfs, Who Manipulates? The Determinants of Opportunistic 


Election Timing and Electorally Motivated Economic Intervention 
MARK ANDREAS KAYSER University of Rochester and University of Oxford 


optimal stopping framework to predict the degree of surfing (opportunistic timing) and manipulation 

(politically motivated economic intervention) under alternate institutional structures. Among other 
results, I find that the likelihood of opportunistic elections rises with exogenous economic performance, 
with longer maximum term lengths, with future electoral uncertainty, and with economic volatility but 
diminishes in the value of office-holding; manipulation increases with the maximum term length and 
with the value of office-holding but decreases with exogenous economic performance and with economic 
volatility. The model suggests that single-party governments should be highly opportunistic in calling 
elections and that countries that allow opportunistic election timing should experience less economically 


[: this paper, I develop a career concerns model of government policy choice within a dynamic 


distortionary political intervention than their fixed-timing counterparts. 


A wise person does at once, what a fool 
does at last. Both do the same thing; only 
at different times. 


—Lord Acton 


dates are not fixed. Governments, especially those 

in parliamentary democracies, often can and do call 
elections before the mandatory end of their term, with 
consequences for domestic politics and economics. A 
well-timed election, such as one that coincides with 
an expanding economy, can boost an incumbent gov- 
ernment’s reelection prospects as well as diminish the 
political demand for politically motivated macroeco- 
nomic intervention. Yet despite the importance and 
prevalence of endogenous, politically determined elec- 
tion timing, we know surprisingly little about the cir- 
cumstances and institutions that yield opportunistic 
early elections. When should elections optimally be 
called? What institutional arrangements favor or in- 
hibit opportunistic elections and economic manipula- 
tion? To what degree, if any, does the possibility of 
Opportunistic timing temper the incentives for stimu- 
lating the economy before elections? 

Scholars and the public alike have long believed that 
governments, where able, time elections (“surf”) and 
manipulate their economies (“manipulate”) for polit- 
ical advantage.’ Political histories and press accounts 
abound with depictions of parliamentary governments 


[: the majority of the world’s democracies election 
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This usage of the term “surfing,” to the best of my knowledge, was 
coined by Takashi Inoguchi (1979) 


consumed with the timing of elections (e.g., Butler and 
Kavanagh, multiple years, and Crossman 1979); sta- 
tistically grounded research has found governments 
to be more (Palmer and Whitten 2000; Roper and 
Andrews 2002) or less (Smith 2003) opportunistic. 
Interest in economic manipulation has been no less 
pervasive. Belief in the political business cycle is suf- 
ficiently widespread that one scholar (Suzuki 1992) 
has found that the public systematically raises welfare 
expectations prior to elections; academics match this 
interest with a steady flow of political business cycle 
models—initially with adaptive expectations, later with 
rational expectations—and empirical tests (cf. Alesina, 
Roubini, and Cohen 1997 and Drazen 2000, chap. 7). 
It is incontrovertible that the public and many aca- 
demics believe that governments gain electoral advan- 
tage from both activities, surfing and manipulation. Yet 
despite the obvious possibilities for surfing and manip- 
ulation to influence each other, ever since Nordhaus 
(1975) first formalized the political business cycle and 
assumed fixed election timing, researchers have ana- 
lyzed them in isolation from each other. 

Beginning with Ito and Park’s (1988) seminal pa- 
per, a wave of studies began to bridge this separa- 
tion by showing preelection economic manipulation 
to be considerably muted in several countries that 
permit endogenous election calling (e.g., Cargill and 
Hutchinson 1991, Chowdhury 1993, and Ito 1990).? 
This predominantly empirical literature offered a new, 
but not the only, explanation for the puzzling ab- 
sence of political business cycles around the world 
(c£ Alesina, Roubini, and Cohen 1997). Why should 
governments engage in distortionary and unreliable 
macroeconomic manipulation when the simpler and 
nondistortionary act of election calling can achieve 
the same electoral end? Other observers of the sur- 
prising absence of political business cycles focused 
on other explanations such as the difficulty of trig- 
gering macroeconomic booms prior to elections (e.g., 


2 One notable exception to the empirical onentation of this research 
area was Chappell and Peel's (1979) early model of the polrtical busi- 
ness cycle with endogenous elections. This model, however, provides 
no uncertainty m the election calling decision 
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Lewis-Beck [1988] 1990) or the rational expectations 
of voters (e.g., Cukierman and Meltzer 1986, Person 
and Tabellini 1990, Rogoff 1990, and Rogoff and 
Sibert 1988).°* Researchers, left with a strong incen- 
tive for governments to manipulate the economy— 
voters, after all, punish governments for poor eco- 
nomic performance—but scant evidence of cycles in 
macroeconomic aggregates, turned their attention to 
a finer-grained instrument: cycles in fiscal balances. 
The intuition is simple: It is easier for governments 
to manipulate fiscal components to target electorally 
important constituencies and signal competence to ra- 
tional voters (Rogoff 1990) than to engineer timely 
improvements in output or other macroeconomic ag- 
pregates. Yet again, however, empirics raised a new 
puzzle: “Political budget cycles” are rarely found in de- 
veloped countries (Alesina, Cohen, and Roubini 1992), 
emerging instead primarily in the developing world. 
This may be attributable to weaker institutions and 
stronger governments (Schuknecht 1996) or to lower 
audience costs of manipulation (Gonzalez 1999) in less 
developed countries. I argue that it is also attributable 
to the preponderance of endogenous election timing 
in developed countries, specifically the OECD. Of the 
24 OECD member countries in 1990, only Norway, 
Switzerland, and the United States fully preclude early 
elections.” 

Opportunistic election timing matters for explaining 
the absence of political-economic cycles, regardless of 
whether they are in fiscal balances or macroeconomic 
aggregates. Surprisingly, however, after the initial burst 
of interest in the moderating influence of surfing on 
manipulation, the field stalled. A combination of di- 
minishing marginal returns from demonstrating the 
same effect in additional countries and the absence 
of testable theories of how institutions affect surfing 
and manipulation dampened interest. I seek here to 
redress this problem by developing an explicit model 
with empirically testable predictions. To be precise, I 
embed a career concerns model of government pol- 
icy choice within a dynamic optimal stopping frame- 
work to predict the degree of surfing and manipula- 
tion under alternate institutional, governmental, and 
constituency features. Among other results, I find that 
the likelihood of early elections rises with the variance 
of exogenous shocks to voters’ welfare,’ with longer 


3 Indeed, macroeconomic manipulation not only may be meffective 

but also may backfire as rational voters, behaving as “fiscal conser- 

vatives,” recognize politically engineered election year booms and 

punish incumbents for expected postelection economic distortions 
ltzman 1992). 

Clark and Hallerberg (2000) offer an additional explanation: Given 
capital mobility, fixed and flemble exchange rate regimes constrain 
monetary policy and fiscal policy, respectively. 

5 However, the constramts on opportunistic slection calling vary 
considerably among endogenous timimg countries 

6 With the notable exception of articles by Heckelman and Berument 
eae and Reid (1998). 

As I explain later, voters ın this framework cannot directly observe 
government competence; they infer rt from their welfare. Consistent 
with empirical literature, they do not distinguish between welfare 
shocks that are beyond the control of their elected representatives 
and those that are more credibly attnbutable to government 
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constitutional interelection periods (CIEP), and with 
uncertainty about the future but diminishes with the 
value of office-holding; manipulation decreases in the 
variance of welfare shocks but increases with the CLEP 
and with the value of office-holding; and as surfing is 
increasing in welfare shocks but manipulation decreas- 
ing, surfing effectively substitutes for manipulation. 

These results are strongly prescriptive for institu- 
tional engineering and suggest possible unintended 
consequences for certain political and constitutional 
arrangements. Consider, for example, how increasing 
international economic integration might have notably 
different effects on politics in endogenous timing and 
fixed-timing countries. As both monetary and fiscal in- 
tervention in the economy become increasingly con- 
strained by economic and political integration, govern- 
ments able to time their elections strategically should 
enjoy an electoral advantage relative to their fixed- 
timing peers. Those governments with the fewest insti- 
tutional and political barriers to opportunistic election 
timing, the smallest, most volatile economies, and the 
longest CIEPs should enjoy the greatest advantage 
in their reelection campaigns. Where manipulation 
is least constrained and economic performance most 
volatile—primarily in the less economically integrated 
developing world—it is the electorate that may gain the 
most from opportunistic election timing. Those states 
that least impede the strategic timing of elections, pri- 
marily parliamentary democracies with long maximum 
terms, should experience less preelection manipulation 
and concomitantly less distortion in the postelection 
economy. 

The second goal of this paper is to improve the un- 
derstanding of opportunistic election timing itself by 
replicating for majority governments what Diermeier 
and Stevenson’s (2000) Lupia and Strom’s (1995) coali- 
tion bargaining models have done for the study of 
government termination under coalition governments. 
By making coalition bargaining explicit, they make 
dissolution and elections—whether opportunistic or 
involuntary—a consequence of the strategic interac- 
tion of coalition members rather than the vaguely epi- 
demiological process that dominated earlier govern- 
ment duration literature.’ After all, as Grofman and 
Roozendaal (1994) once commented, it is the choices 
of actors within circumstances created by exogenous 
shocks that bring cabinets down. Election timing by 
single-party governments is considerably simpler than 
dynamic coalition bargaining but much can still be 
gained from an explicit consideration of the actors’ 
choices within circumstances created by exogenous 
shocks. Single-party governments, for example, maxi- 
mize both the duration of the current term and the 
probability of reelection, not, as the traditional govern- 
ment duration literature assumes, just time in office. 
I model this trade-off between extending the current 
term and winning another as a dynamic optimiza- 
tion problem. This is not the first model of strategic 
election timing—signaling games by Smith (1996) and 


8 See Grofman and Roozendaal’s (1997) review article for a history 
of the coalition stability and duration literature 
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Terrones (1989) find equilibria on no early elections 
and early elections, respectively,’ and dynamic pro- 
gramming models by Balke (1990) and Ito (1990) 
yield numerical predictions of election timing—but this 
model, in addition to explicit consideration of manip- 
ulation, offers the first analytic predictions of when 
elections should be called and under which institutions 
they should be most prevalent. 

The remainder of this paper develops, extends, and 
discusses the importance of a dynamic stochastic model 
of opportunistic election timing and economic manip- 
ulation. I begin with a parsimonious election timing 
model illustrating how governments form a finite- 
horizon optimal strategy for calling elections to maxi- 
mize their reelection probability. Governments in this 
initial setup only decide between calling and not call- 
ing an election in each period of their term; they only 
care about reelection; and they increase their reelec- 
tion probability by coordinating elections with favor- 
able, exogenously determined states of the world. A 
subsequent section then builds on the initial model by 
considering utility from office-holding, not just reelec- 
tion. Early elections cost governments the remainder 
of their current term, a trade-off that matters if office- 
holding itself is desired. I next constrain the stochastic 
state variable to economic shocks and make the voters’ 
decision calculus explicit. Other noneconomic events 
certainly matter for election timing, but parsimony, as 
well as the upcoming exploration of how timing sub- 
stitutes for economic manipulation, demands a ceteris 
paribus focus on macroeconomic management. Voters 
in this third section of the model now infer competence 
from the government’s provision of public goods, a gov- 
ernment activity that is constrained by the exogenous 
performance of the economy. The final extension of 
the model, after a brief digression on the intuition of 
the election timing model, allows governments to cir- 
cumvent this budget constraint by increasing current- 
period spending via a distortionary tax on the postelec- 
tion future. Comparative statics then suggest, among 
other results, that surfing does substitute for manip- 
ulation, that countries with more volatile economies 
surf more (and manipulate less), that longer maximum- 
term lengths (CIEPs) increase both the likelihood of 
opportunistic elections and political intervention in the 
economy, and that the value of office holding decreases 
surfing but increases manipulation. 


’ Early elections in Terrones’ model signal high competence because 
a low-competence government, knowmg it would stand little chance 
of reelection, should simply maximize its time m office by postpon- 
ing elections to the last possible moment. Early elections in Smith’s 
model signal low competence because high competence governments 
have greater confidence in therr ability to perform well in the future. 
Later empmncal work by Smith (2003) confirms the spirit, rf not the 
equbbnum, of his signaling model. Bntssh governments call early 
elections but, because early elections signal government foreknowl- 
edge of imminent downturns, voters penalre them for doing so. The 
penalty, however, is not sufficient to preclude opportunistic election 
calling. The stochastic model presented here predicts that govern- 
ments call elections during exceptionally large expansions unlikely to 
recur m the same term. This 1s often, but not always, observationally 
equivalent to calling elections before downturns 
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As a majority of parliamentary democracies allow 
for endogenous election timing, the implications ‘of 
this research are neither obscure nor trivial. A clearer 
understanding of why and when elections are called 
promises broadly applicable insights into electoral pol- 
itics and the political business cycle. 


THE BASIC MODEL 


Election timing is quintessentially a problem of op- 
timization under uncertainty. The incumbent govern- 
ment assesses electoral conditions in each period of 
its term and decides whether to call an election or 
to proceed to the next period, not knowing what that 
period holds. Opportune timing can increase both the 
probability of electoral victory and legislative efficacy 
through greater seat share but also effectively stops 
(i.e., resets) the game. Thus, election timing is fun- 
damentally a finite-horizon optimal stopping problem. 
The government has a fixed number of periods in which 
to call an election before one is imposed and therefore 
times the election to coincide with the most electorally 
advantageous circumstances. 

Recognizing election timing as an optimal stopping 
problem allows one to model a government’s dynamic 
decision process explicitly. Dynamic programming 
techniques, as explained below, permit current-period 
options to be repeatedly compared to the expected 
value of future opportunities. Although the unknown 
values of several parameters prevent us from predict- 
ing the magnitude of different variables’ effects on the 
probability of elections in this model, it is nevertheless 
possible to predict the sign of that effect.1° 

Imagine for now the simplest possible arrangement: 
Voters have no memory of previous periods; the gov- 
ernment’s reelection probability, p, in each period 
te {1,2,3,..., T}, is strictly increasing in the random 
state variable, y, distributed, for simplicity, uniformly 
with density ¢; each draw of yz, is independent of others; 
the value of office-holding is constant over time; and 
governments simply maximize their chance of reelec- 
tion, by timing elections to coincide with the highest u; 
that they believe will occur in their term. 

Given that the government cannot see future states, 
how does it determine that the current state is the 
best that they will likely see before their term expires? 
The government knows the distribution from which 
p is drawn and, as in any stationary Markov decision 
problem, plays a strategy to maximize its payoff: Call 
an election only if the current state exceeds the best 
expected future state. Assuming optimizing behavior 
in each future period allows us to calculate the gov- 
ernment’s expected future state via backward induc- 
tion. Thus, for example, in the penultimate period, 


10 Estumating discrete choice dynamic programming ters 1s 
often posmble, however, and has received considerable attention 
from some econometricians and labor economists. Eckstein and 
Wolpin (1989) offer a survey of several approaches and newer tech- 
niques (¢.g., Keane and Wolpin 1994) continue to emerge. In the case 
of this paper, one possibility might be to employ polling data in place 
of the current stochastic economic shocks 
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t—1, the government’s expected state in the subse- 
quent and final period, E,_1(,), is simply the ares 
value of the random variable u, p ~ U[1 — 31+ 5], 
i.e., one. In this penultimate period the government 
will call an election if the current state of the coun- 
7 H-1, is greater than the expected future state, 

E,-1(u4,) = 1. This logic is then extended to the pre- 
ceding period, t— 2. The expected state continuing 
from +t —2 to t—1 is simply the value of playing an 
optimal strategy at t—1, i.e, the average expected 
state from t — 1’s two possible outcomes over all pos- 
sible values of u: (1) continuing to the final period if 
H-1 < E,_1(4,) [payoff = 1] and (2) calling an elec- 
tion if ur-1 > Er-1(Hr) [payoff Ey-1(ur) | H-1 > 1]. 

Expressed more succinctly, the government’s deci- 
sion at each ¢ solves the program 


1+} 
| 4 max{E;(ur41), Hi} k ay, 
1 1 
—,1 Pn E) 
z +) 


which implies that the value of playing an optimal strat- 
egy at ris 
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or, more explicitly, 
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Thus the expected state from playing an optimal strat- 
egy can be calculated at any period and the expected 
value of continuing is just the value of an optimal strat- 
egy in the subsequent period.” Since E(u141), the ex- 
pected utility of continuing in office, is a monotonically 
decreasing function of time, converging to E(u), Le., 
unity, in the penultimate period, it is least likely that 
current-period circumstances will exceed the expected 
future circumstances at the beginning of a term. As 
remaining tenure wanes, however, the expected future 
state steadily declines, thereby increasing the probabil- 
ity that a current period random draw of u will exceed 
it and trigger an election.” 


11 Keep in mind the distinction between E(u) and E(u) The former 
is simply the expected value of a single draw from the state varable 
(p) distribution, whereas the latter 1s the expected utility of playing 
an optimal strategy at t 

12 Equations (1) and (2) depend on the Bellman optimahty equation 

See Sundaram ([1996] 1999, 274) for an explanation of how a deci- 
sion rule constitufes an optimal strategy in finite horizon dynamic 
programming See Kreps (1990, Appendix 2.2) for a mathematically 
similar optimal stopping example. 

13 The expected utility of continuing declines with time because the 
number of future draws decreases with time. This process, to borrow 
from the extensive optimal search and stopping applications 1n labor 
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OFFICE-HOLDING 


Governments, however, derive utility from many time- 
dependent actions in office, not just from reelection 
per se. In this context, election to office becomes only 
a necessary condition: Governments maximize their 
reelection probabilities in order to extend their time 
in office but must trade off a probability-weighted new 
term against the remainder of their current term. Call- 
ing an election too soon costs the government the utility 
from the remainder of the current term; waiting too 
long reduces the chances of another favorable period 
occurring before mandatory dissolution. Thus a ratio- 
nal incumbent attempts to call elections in the Zast best 
period possible. 

When utility comes from office-holding, the incum- 
bent’s decision becomes 


U(t) = max i l (3) 


Each period in office the government gains one unit of 
utility—consider this an ego rent, R—and receives new 
information about the current state of the country. The 
rational office-seeking incumbent continues in office 
until the expected utility of calling an election exceeds 
the expected utility of continuing in office, knowing 
that future elections may arrive under less fortunate 
circumstances. I represent A, the value of continu- 


ing, as 
à = eH p(El um) R+ -AR (4) 


The second term in the equation, (t — t)R, captures 
the remaining utility in the current term in office and 
shrinks as the term progresses, providing a diminishing 
incentive to forgo favorable election opportunities. t is 
the maximum term length in periods and t represents 
the current period. 

The first term in Eq. (4) represents the other im- 
portant element in determining election timing: the 
expected value of calling an election in the future. 
When incumbents expect favorable circumstances for 
an election in the future, it is easier to forgo current 
opportunities. More precisely, the expected value of a 
future election is the product of the reelection probabil- 
ity generated by the expected future state p|E(u14+1)] 
and the utility of a new term in office, tR, appropri- 
ately time discounted. Against this expected value of 
continuing, politicians must weigh the utility of calling 
elections. In its most basic form the expected utility of 
elections in any period is 


ni = p(u,)tR, (5) 


simply the maximum length of another term weighted 
by the current-period reelection probability, p (me). 


economics, 1s akin to that faced by a job seeker with a fixed number 
of sequential job offers with salaries drawn from the same distri- 
bution. At the final offer the job seeker will have to accepted the 
expected value of a single draw from the distribution, Le., its mean. 
The penultimate offer will therefore only be accepted if it exceeds 
the expected value of the final offer, and so on, backward to the first 
offer. 
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VOTERS 


Until now, the government’s reelection probably has 
simply been defined as a function of the state vari- 
able without explicit consideration of how voters form 
their preferences. I now make the voters’ preference 
mechanism explicit with respect to economic welfare. 
I redefine the state variable u as a stochastic shock to 
the economy to generate a tractable model with budget 
constraints that can later trade off economic manipu- 
lation and election timing. This is not to claim that 
noneconomic events do not affect election timing— 
they do—but simply to restrict this model to ceteris 
paribus effects of the economy. 

Adopting a career concerns framework 
(Dewatripont, Jewitt, and Tirole 1999a,b; Holstrém 
1982; Persson and Tabellini 2000) positing asymmetric 
information, J can now make the voters’ selection 
process—and hence the government’s reelection 
probability calculation—explicit. Voters hold govern- 
ments accountable for economic performance but, 
unable to observe government competence directly, 
infer it from the provision of public goods. More 
explicitly, suppose that government provision of public 
goods, g;, is constrained by 8: = z,( Ty + s,), where 
T represents the tax rate,’* y, income; Z, aggregate 
two-period economic shocks; and s,, a hidden and 
distorting tax that shifts resources from the future 
to the present, improving current period welfare 
at the cost of the equivalent amount plus negative 
economic distortions, V(s), in the subsequent period. 
Knowing that more competent governments provide 
more public goods, but impotent to influence p, 
governments are tempted to create the appearance 
of greater competence by increasing s. While voters 
can immediately observe g, T, and y, both z and s 
remain at least partially unobservable (primes indicate 
inferred variables). Voters therefore infer overall 
government competence as z= g,/(Ty+s(). Past 
aggregate competence, Z%-—1, is both observable and 
related to present competence but the current period’s 
competence cannot be observed directly. Imagine 
aggregate competence, z, as the sum of the random 
and serially uncorrelated economic shock variable uz 
over the present and immediately preceding period, 
Z; = H-1 + Hr, Where ui is observed by government 
and voters alike but knowledge of ur is reserved for 
the government. Given that the random variable p 
is distributed cue with mean one and density 
gu ~ U1—+,1+4+ £], voters choose the government 
if its inf current competence u; is greater than 
the opposition’s expected competence, E(j19) = 1.1516 


14 The tax rate, T, remains fixed in this model m the mterest of 
simplicity. Transforming T mto a choice vanable would mandate a full 
welfare function for voters and unnecessary complenity. With T fixed, 
povorimenig may only choose between surfing and manipulation. 
Unity is chosen amply to simplify later algebra and has no sub- 
stantive effect on the model’s comparative statics Objections that 
government competence increases welfare beyond the government’s 
resources are easily met by reducing E(u) to one-half or less 
ló Note that variables not directly observed by the electorate are 
denoted by prime. 


Thus, the government’s reelection probability is 
0 if ped, 
PeS , ; 
1 if Ly > 1; 
simply the probability that its inferred current compe- 


tence exceeds the opposition’s expected competence. 
More explicitly, 





&r 
= Prin iPr — Hm> 1 
mz =Pr| miei] © 
or, reexpressing g; and rearranging, 


Ppi = Pr |F u + fr) — He- È 1). (7) 
Keen observers will notice the possibility of 
bounded rationality. Voters base their assessments 
of government competence—and hence election 
preferences—on economic performance, yet govern- 
ments have little direct influence over the short-term 
performance of the macroeconomy.!” Assuming some 
price stickiness, as some recent real business cycle 
models attempt (cf. McCallum 1999), builds in a lag 
on inflation that then resembles the model offered 
here: Governments can provide real short-term stim- 
ulus but only at the cost of future distortions. In the 
absence of such price stickiness, governments would be 
constrained—as in political budget cycle models—to 
fiscal manipulation, which obviously also generates 
an intertemporal transfer. Thus, I continue to assume 
that manipulation is only possible in combination with 
costly transfers from the future. but that, consistent with 
empirical political science, voters nevertheless reward 
and punish governments on the basis of economic per- 
formance (Lewis-Beck and Stegmaier 2000). 

Voters, therefore, may be fully rational or boundedly 
rational, depending on the cost to them of the govern- 
ment influencing the economic variable from which 
they infer competence. In this model the government 
can Only signal competence by increasing public goods 
provision, g, either though opportune timing (hence a 
higher z) or by increasing s, a hidden and distortionary 
tax on the future. Although exogenous to the model, 
if the value to the voters of the competence signal in- 
ferred from g exceeds the time-discounted future dis- 
tortionary effects of s, then the voters are fully rational 
in much the same way they are in rational political 
budget cycle models (e.g., Rogoff 1990). Alternatively, 
if the value of the competence signal does not exceed 
that of the discounted future distortions, voters are 
boundedly rational. Empirically, there are numerous 
examples of voters holding politicians accountable for 
acts of God that they can at best mitigate but cannot 





17 See the debates over monetary policy ineffectiveness in the real 


business cycle literature launched by Kydland and Prescott (1980, 
1982) following the rational expectations contributions of Lucas 
(1972) and others. Note that the standard assumption ın the real 
business cycle literature 1s that the cyclical real effects of monetary 
policy are very small or nonexistent. Empirical support for the policy 
ineffectiveness assumption 1s provided, among others, by Litterman 
and Weiss (1989). 
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FIGURE 1. 


Election Calling Over Time 
50 


prevent.'8 This behavior may even induce governments 
to engage in costly acts of signaling that reduce future 
welfare. The framework developed here is sufficiently 
general to accommodate either assumption about voter 
behavior. 

Finally, consistent with this discussion, note that re- 
election probabilities are increasing in yw, and s; in 
Eq. (7), providing governments with an incentive to 
manipulate the economy in election periods for elec- 
toral advantage. To provide more realism (and an in- 
terior solution), I explore the derivation of optimal 
manipulation, s*, below; but first let us consider the 
model’s broader intuition. 


THE INTUITION 


Thus far I have established a framework for how gov- 
ernments calculate reelection prospects and how voters 
assess povernment competence. In brief, governments’ 
expected future election-period competence dimin- 
ishes as the remaining time (and hence election-calling 
opportunities) wanes; voters infer competence and se- 
lect candidates based on their own welfare; and govern- 
ments repeatedly compare current and expected future 
utility in deciding when to call an election. Temporarily 
disallowing economic manipulation and adapting the 
government’s election-calling decision to a fully dy- 
namic setting in which future periods within the present 
term are discounted relative to more immediate peri- 


18 The recent paper by Achen and Bartels (2002) showing how New 
Jersey voters punished local politicians for shark attacks (among 
other examples) is an amusing illustration of this. Susan Hansen 
(1999) shows that even competent state governors are often punished 
by voters for national economic trends such as unemployment over 
which they have little influence. 
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ods now allows us to strengthen the model’s intuition by 
simulating an incumbent’s period-by-period election- 
calling decision. 

A period-by-period decision process requires us to 
calculate the present value of each term’s ego rent 
“revenue stream” at the time of each election calling 
decision, i.e., at each t. Not only do governments view 
future terms as less valuable than the present term, 
but within a given term, a distant period is less valu- 
able than the current period. Hence, the government’s 
election-calling decision should be expressed as 


K f Epu)Re™ di 
T 
max + Re™ dt (continue) . (8) 
t 
/ pRe “dt (call) 
0 


This decision is best presented graphically. Figure 1 
simulates the calling (solid line) and continuing (dotted 
line) decision over time, assuming (= 3, 6 = .02, and 
t = 60. Obviously, when calling exceeds continuing, an 
election is called. Toward the beginning of a term, the 
time remaining in the current term, t — t, together with 
high expected future popularity, E(p:+1), ensures that 
the value of continuing in office exceeds the value of 
calling elections. As time in office progresses, however, 
both + — t and E(p;41) diminish, lowering the value of 
waiting to the range where an exogenous stochastic 
event may make a snap election the more appealing 
option. From the government’s perspective, when A 
exceeds n, circumstances are better than expected and 
likely only to deteriorate in the future. 
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FIGURE 2. Election Calling, Long Voter Memories 


20 30 40 
Period 
(C=3; 6 = .02; m =.7; t=60) 


What if voters’ affections are not so fickle? Although 
extraneous to the model at hand, it is not difficult to 
simulate voters who recall previous events and up- 
date their assessment of government competence more 
slowly. Constructing p as a linear combination in which 
past reelection standing is combined with new events 
in proportion to the strength of voter memory does not 
change the model’s fundamentals. Suppose that 


Pr=mp.41+0- m)D;» (9) 


where m, m € [0, 1], captures the strength of the previ- 
ous period’s influence on current reelection prospects 
and p; represents the current period instantaneous 
reelection probability, i.e., the government’s chance 
of reelection if voters base their decision on only 
the current period’s events. At higher levels of m, 
new developments affect reelection prospects less and, 
obversely, lower levels of m weight present events 
more. 

Voter memory reduces the volatility of government 
reelection probabilities and consequently may lower 
the likelihood of early elections. Figure 2 (¢= 3,6 = 
.02, m = .7, and t = 60) illustrates this: n, the value of 
calling at each f¢, is visibly less volatile than in Figure 1. 
A broad definition of collective voter memory, such 
as one that includes education levels and the indepen- 
dence and quality of the press in addition to innate 
human memory, would imply that societies with higher 
(lower) levels of education and a better (worse) press 
may be more (less) resistant to opportunistic election 
calling. Speculation aside, both Figure 1 and Figure 2 
illustrate the model’s fundamental intuition: A high 
initial value of continuing descends into a range where 





it may be exceeded by stochastic competence shocks 
as a government ages. 


OPTIMAL MANIPULATION 


As the model is currently constituted, reelection- 
minded governments should simply increase economic 
manipulation until their probability of reelection ap- 
proaches unity, engendering an economically destruc- 
tive vicious cycle as expectations also adjust upward. As 
voters do not witness the distortionary effects of s un- 
til after the election, governments should consistently 
inflate the economy beyond expectations to ensure 
victory. 

However, a government that cares about voter wel- 
fare, or, more cynically, its own reputation and future 
reelection bids, will avoid this corner solution by trad- 
ing off reelection probability and voter welfare. Pre- 
election manipulation induces postelection penance in 
the form of (1) lower government revenue, (2) lower 
public goods provision as gm1 = %41(Ty —5;), and 
(3) negative distortionary effects, V(s,), such that 


wmi = YL — T) — Vina (Sr) + g1. (10) 


In short, governmental sins of the past revisit the vot- 
ers’ present as the distortionary effects of previous 
period manipulation and lower levels of public goods 
as the government repays election period obligations. 
Incumbents thus maximize a combination of reelection 
conditional utility and voter welfare in choosing the 
optimal level of economic manipulation, 


max ptR + E(uy41), (11) 
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or, equivalently,’ 








tR s,—S, ae 
ae eg ae te 
max -> + | eo MBs + Ely — T) 


— Vaa (Se) + (er + rg CTY — 51)], (12) 
which, taking expectations, E(j,41) = 1, and retaining 





only necessary subscripts, yields 
aU Hi S +T — hs + Heas | 
a i a dl a 
as E Fs FY 
-V (s)- m- 1. (13) 


Multiplying through by the inverse of V’(-) and setting 
s’ = s (in equilibrium) provides the FOC: 


Rīg(u-1 +1) wy +1 
Vilyt+s) V 


With a little help from the implicit function theorem, 
one can now see that optimal manipulation is decreas- 
ing in competence: 


Os PS EE 
au, Rrt(wy1+1)+WV(+s)? 


Governments, concerned with their legacies and con- 
stituents’ welfare, are not willing to induce too severe a 
postelection economic distortion in return for marginal 
increase in reelection probability. While strong incen- 
tives exist to manipulate the economy for electoral gain 
in election periods, the degree of such intervention is 
tempered by concerns about economic repercussions. 





+5. (14) 


(15) 


COMPARATIVE STATICS 


So what does this mean for election timing? The com- 
parative statics of this model bear strong implications 
for optimal election timing. Understanding the effect 
of t, R, & u, and å on p has—as I show below—direct 
implications for optimal timing and, consequently, in- 
stitutional design. We will also see that opportunistic 
timing, in turn, has its own implications, especially in 
relation to manipulation. 

A single observation, posed as a proposition below, 
allows for relatively simple comparative statics on elec- 
tion timing. 


Proposition 1. Let E(teec) be the expected period 
in which elections are called under optimal election 
timing. Then E(tetec) is strictly decreasing in E(p,) if 
e XY E(pyi1)/E(p1) < 1. 


19 Solving expression (7) for u; yields the critical value, wet = 
(3 + Ty + iuf — y—-15)/(Ty +5). p: is the area in the y distribution 
where u > u", that is, py = ¢[1 + (1/29 — w™] or, more expheatly, 
Pr=4+ lies ~ l- fy) Person and Tabellmi (2000) offer an 
excellent explanation of probabihstic voting techniques The 
expectations term represents expected welfare in the perlod follow- 
ing the election period. I assume that revenues, s;, borrowed in the 
election penod must be repaid, thereby reducing the provision of 
public services in the succeeding period, gr41. 
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Given Eq. (8), any increase in E(p,), the central com- 
ponent in calling, yields a greater increase in E(p;+1), 
the primary component in continuing, in all periods 
prior to t — 1. The expected value of playing an op- 
timal strategy over multiple future periods is greater 
than the (expected) value of the single draw in the 
current period, so any increase in single draw E(p,) 
is amplified in E(p,41). This implies that the value of 
continuing increases in E(p,) at a faster rate than the 
value of election calling, yet as calling is immediate and 
continuing deferred, only the latter is time discounted. 
As long as a government discounts the future at a rate 
sufficient to offset the rate at which E(p,,1) surpasses 
E(p,), then d(a — n)/dE(p;) < 0.” 

Given that e*'-E(p,41)/E(p;) < 1 obtains, it is 
now possible to draw a number of conclusions about 
the effect of several societal and institutional features 
on election timing and manipulation. Proposition 1 
implies that opportunistic timing increases (i.e., à — n 
decreases) in 


1. u. Governments cannot influence short-term eco- 
nomic conditions (short of distortionary manipula- 
tion) but they are able to call elections to correspond 
with opportune circumstances. Timing elections to 
correspond to periods of better economic perfor- 
mance (greater u) increases public goods provision, 
the electorate’s estimate of government compe- 
tence, and the incumbent’s current period reelection 
probability. Opportunistic elections should there- 
fore be more likely in periods of exceptionally 
strong economic performance. 

2. 6. The more a government discounts the future, the 
more A, the value of continuing, is reduced relative 
to n, calling. This suggests that minority govern- 
ments or governments with narrow majorities or low 
party discipline—that is, any government with low 
expectations of staying in power for long—should 
be more inclined to opportunistic timing. 

3. t. Longer maximum terms postpone the expected 
election period, taec, but increase opportunistic elec- 
tion calling by decreasing à — n. The prior occurs 
because the remaining term sacrificed by early elec- 
tions increases; the latter occurs because longer 
terms raise the value of a piven term, thereby in- 
voking proposition 1. Opportunistic election calling 
should therefore be more common in states with 
longer maximum terms. 


2 The condition that e“*-9 E(p,.1)/E(pr) < 1 is very reasonable. 
For example, even near the end of a term where dE(p;41)/dE(p;) 18 
greatest and the exponential discounting slightest, given p ~ U[}, 7] 
and three remaining periods prior to mandatory elections, any ê 
greater than 02 suffices to ensure that an increase in E(p) will 
raise the utility of callmg more than continuing. In the preced- 
mg period, r—4, the discount threshold falls to .018, and by the 
first period of a 60-period term (think of monthly periods in a 
five-year term), any value of å greater than .002 will ensure that 
eX") F(p.41)/E(p,) < 1 and therefore d( — n)/dE(p) < 0 Thus, 
anything that raises E(p,) yields more opportunistic election calling 
and earlier elections 1f e*"*—) E(p4,1)/E(p;) < 1 and reduces oppor- 
tunistic election calling and extends expected 


government tenure if 
eK E(p141)/E(pr) > 1 
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But opportunistic timing decreases in 


4. R. The greater the value of office-holding, the less 
opportunistic the government. An increase in the 
value of office-holding (fewer checks on power, 
weaker opposition, etc.) is reduced by reelection 
uncertainty in the calling function common to A and 
n but is unmodified in the remainder of the present 
term unique to A. 

5. ¢ The higher the density of u, the lower the vari- 
ance of u, the lower the probability of a draw of 
u sufficiently above E(u) for n to exceed A. Stated 
differently, the frequency of opportunistic elections 
is positively associated with the variance of eco- 
nomic performance. Opportunistic election timing 
should therefore be more common in countries with 
volatile economic performance, such as developing 
countries and developed countries with sectorally 
concentrated economies. 


Similarly, Eq. (14) shows that manipulation, s*, in- 
creases in 


6. R. A greater value of office-holding increases 
the optimum level of reelection motivated inter- 
vention in the economy. Stronger governments 
should therefore be expected to manipulate their 
economies more, not just because they can, but be- 
cause they have a stronger incentive to do so. 

7. t. Longer maximum term lengths increase the value 
of office-holding and the level of election-motivated 
economic manipulation. All else equal (especially 
election timing), economic manipulation should be 
greater in states with longer maximum terms. Since 
states with longer term lengths are also more likely 
to reduce manipulation by calling opportunistic 
elections, empirical testing requires careful selec- 
tion. 

8. ¢ A higher density in the distribution of exogenous 
economic shocks implies more manipulation. Imag- 
ine an infinite density so that every draw of u equals 
E(u); then only manipulation would remain as a 
means of increasing perceived competence. Manip- 
ulation should therefore be greater in states with 
stable economies that, by varying little, offer few 
benefits from opportunistic timing. 


But manipulation decreases in 


9. u. Positive exogenous economic shocks substitute 
for economic manipulation, lowering the need for 
distortionary manipulation by raising the reelection 
probability. Elections held in periods of exceptional 
economic performance should therefore be associ- 
ated with less economic manipulation. 


Finally, it is now possible to see that greater eco- 
nomic shocks increase opportunistic timing but (recall- 
ing Eq. [15]) decrease manipulation. Thus, surfing and 
Manipulation are inversely related in u. Opportunistic 
timing effectively makes x, an exogenous random vari- 
able, a choice variable. Institutional arrangements that 
enable opportunistic timing enable a higher election 
period u and consequently reduce manipulation. 


CONCLUSION 


With the exception of three elections called in the first 
19 months in office by governments hoping to improve 
weak parliamentary positions, none of the 16 British 
general elections since the Second World War have 
been called before 40 months, two-thirds of the max- 
imum five-year term (Butler 1995 [1989]; Keesing’s). 
Yet only two governments (under Alec Douglas-Home 
in 1964 and John Major in 1997) have run their en- 
tire term. New Zealand exhibits a strikingly different 
pattern: There, it is extremely rare for a parliament 
not to run its entire three-year course, although op- 
portunistic elections are clearly allowed. Since World 
War Two, only 3 of 20 New Zealand parliaments have 
been dissolved early (by Holland in 1951, Muldoon 
in 1984, and Clark in 2002). In Great Britain politi- 
cal business cycles in macroeconomic aggregates are 
largely absent; in New Zealand exceptional economic 
growth prior to elections has been highly apparent 
(cf. Alesina, Roubini, and Cohen 1997). The compar- 
ative statics derived in this paper broadly correspond 
to these empirical patterns. States with longer maxi- 
mum terms, such as the United Kingdom, should ex- 
perience more frequent opportunistic election calling 
than their shorter-term peers, such as New Zealand. It 
then follows that countries with more opportunistically 
timed elections, such as the United Kingdom, should 
also have less evident political business cycles than 
less opportunistic countries like New Zealand. This 
paper provides some tentative explanations for these 
observations and suggests the existence of additional 
regularities. 

More specifically, I have found that surfing increases 
in exogenous economic performance, in future elec- 
toral uncertainty, in the maturity of a parliament, in 
the maximum length of a term and in the variance of 
economic performance but decreases in the value 
of office-holding. Manipulation increases in the value 
of office-holding and in the maximum length of a term 
but decreases in economic performance and in the vari- 
ance of shocks to voter welfare. Importantly, better 
economic performance increases opportunistic timing 
but diminishes election-motivated economic manipula- 
tion, implying an inverse relationship between surfing 
and manipulation. 

Earlier work has highlighted the endogeneity bias 
inherent in empirical tests for political business (and 
budget) cycles when opportunistic election calling 
is permitted but not modeled (e.g., Heckelman and 
Berument 1998). Yet this 1s the first paper to delineate 
the relationship between surfing and manipulation. 
This paper explains why, where, and to what magni- 
tude opportunistic timing should affect manipulation. 
Where governments have the least influence over 
their domestic economy and economic performance 
is volatile, endogenous election timing may reduce the 
manipulation the most. The tempering effect of surfing 
on manipulation may be greatest in trade-exposed de- 
veloping countries with volatile terms of trade. Mexico, 
for example, might have avoided its notorious cycle of 
preelection currency overvaluation and postelection 
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devaluation if it permitted endogenous election tim- 
ing. Canada, where some newspapers and the current 
opposition are calling on the federal government to 
follow British Columbia and Ontario’s recent adoption 
of fixed election timing, might be wise to consider the 
implications for preelection macroeconomic manage- 
ment. 

The implications for election timing per se are no 
less important than those for the political business cy- 
cle. The model suggests that most single-party govern- 
ments should be highly opportunistic in calling elec- 
tions. Governments able to coordinate elections with 
an expanding economy should enjoy greater incum- 
bency advantage than their fixed-timing peers. More- 
over, constraints on possible tools of monetary and 
fiscal manipulation—for example, the European Eco- 
nomic and Monetary Union, increasing capital mobil- 
ity, the European Union’s growth and stability pact, 
or simply a growing share of exports in GDP—should 
only increase this difference. 

Finally, this paper provides a central role for strategic 
decision making in research on government termina- 
tion. Traditional empirical research on government ter- 
mination, constrained by the limits of discrete time and 
hazard models, has largely ignored strategic behavior 
(e.g, Warwick 1994). Formal work has incorporated 
strategic behavior but has been either numeric (e.g., 
Balke 1990) or not fully dynamic (e.g., Lupia and Strom 
1995). The present dynamic stochastic model, together 
with advances in the estimation of dynamic stochastic 
discrete choice models (e.g., Keane and Wolpin 1994), 
should offer new opportunities for predicting and esti- 
mating the effects of various political, institutional, and 
economic arrangements on surfing and manipulation. 
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Accountability and Abuses of Power in World Politics 
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ebates about globalization have centered on calls to improve accountability to limit abuses of 

power in world politics. How should we think about global accountability in the absence of global 

democracy? Who should hold whom to account and according to what standards? Thinking 
clearly about these questions requires recognizing a distinction, evident in theories of accountability at 
the nation-state level, between “participation” and “delegation” models of accountability. The distinction 
helps to explain why accountability is so problematic at the global level and to clarify alternative pos- 
sibilities for pragmatic improvements in accountability mechanisms globally. We identify seven types of 
accountability mechanisms and consider their applicability to states, NGOs, multilateral organizations, 
multinational corporations, and transgovernmental networks. By disaggregating the problem in this 
way, we hope to identify opportunities for improving protections against abuses of power at the global 


level. 


business, expansion of the scope and authority 

of multilateral organizations, and rapid increases 

in the number of nongovernmental organizations 
(NGOs) have heightened concerns about the way 
power is used and abused on the world stage. Contro- 
versy about globalization pivots around the twin issues 
of accountability and democracy. A crucial feature of 
representative democracy is that those who govern are 
held accountable to the governed. If governance above 
the level of the nation-state is to be legitimate in a 
democratic era, mechanisms for appropriate account- 
ability need to be institutionalized. Yet these mecha- 
nisms cannot simply replicate, on a larger scale, the 
familiar procedures and practices of democratic states. 
We seek in this article both to explain why account- 
ability is so problematic at the global level and to 
suggest pragmatic improvements in accountability 
mechanisms. Our analysis considers accountability as 
only one of several ways in which power can be con- 
strained, examines nondemocratic accountability me- 
chanisms as well as democratic ones, and scrutinizes 
the analogy between global and domestic power struc- 
tures in order to specify the nature of the problem of 
accountability in global politics more clearly. We distin- 
guish two basic concepts of accountability: delegation 
and participation. We explore accountability mecha- 
nisms of both types that could be utilized to regulate 


Te interdependence of states, globalization of 
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global powers in a manner compatible with democratic 
principles (see Held 1995 and Young 2000, chap 7). 

Our analysis helps to resolve a puzzle that is posed 
by conventional views of accountability. The prevailing 
view in the international relations literature of mul- 
tilateral organizations, such as the World Bank and 
World Trade Organization (WTO), is that these entities 
are weak relative to states. But critics of global- 
ization view such organizations as relatively uncon- 
trolled, criticizing them as “unaccountable” while cel- 
ebrating the democratic accountability of states (Dahl 
1999). Thus, multinational organizations are character- 
ized as both weak and unchecked at the same time. 
We argue that multilateral organizations are in fact 
accountable—indeed, more accountable in many re- 
spects than powerful states—but in ways quite differ- 
ent from those envisaged by observers who equate ac- 
countability with participation. Similarly, our analysis 
helps to explain why the parties to the debate over 
global accountability so often seem to be “talking past 
each other.” To officials of the World Bank, it is the 
NGOs who seem accountable to nobody, whereas the 
officers of the Bank must answer to their supervisors 
and ultimately to the states that empowered them. To 
the representatives of the NGOs, it is the World Bank 
that lacks accountability because it does not answer 
to those affected by its policies, the very people for 
whom the NGOs claim to speak. An agreement on the 
need for greater accountability will not resolve this dis- 
agreement without a consensus on what accountability 
entails. 

Accountability, as we use the term, implies that some 
actors have the right to hold other actors to a set of 
standards, to judge whether they have fulfilled their re- 
sponsibilities in light of these standards, and to impose 
sanctions if they determine that these responsibilities 
have not been met. Accountability presupposes a re- 
lationship between power-wielders and those holding 
them accountable where there is a general recognition 
of the legitimacy of (1) the operative standards for ac- 
countability and (2) the authority of the parties to the 
relationship (one to exercise particular powers and the 
other to hold them to account). The concept of account- 
ability implies that the actors being held accountable 
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have obligations to act in ways that are consistent with 
accepted standards of behavior and that they will be 
sanctioned for failures to do so. 

Thus, not all constraints on abuses of power in world 
politics constitute mechanisms of accountability. Uni- 
lateral uses of force, though they are often described 
as “holding someone accountable,” do not qualify as 
accountability mechanisms in our sense. In the clas- 
sical, European balance of power system,! the prin- 
cipal mechanism of constraint was, ultimately, coer- 
cion or the threat of coercion. States were the exclu- 
sive sources of legitimate authority, and though they 
could legally bind themselves through international 
treaties, if treaties were broken, states had to resort 
to self-help to assert their rights. States in a balance of 
power system are not “held accountable” in any mean- 
ingful sense, although they may be constrained by co- 
ercion or the threat thereof Similarly, the economic 
interdependence of states creates constraints, but states 
that are bargaining for advantage on economic issues 
are not necessarily held accountable to one another. 
States in the nineteenth century engaged in bilateral 
tariff negotiations, confronting limits on their ability 
to achieve their objectives. But since they had under- 
taken no prior obligations by joining multilateral insti- 
tutions, there was no accountability process involved. 

There is another important mode of constraining the 
powerful that must be distinguished from accountabu- 
ity. “Checks and balances” are mechanisms designed to 
prevent action that oversteps legitimate boundaries by 
requiring the cooperation of actors with different insti- 
tutional interests to produce an authoritative decision. 
Accountability mechanisms, on the other hand, always 
operate after the fact: exposing actions to view, judging 
and sanctioning them. The executive veto power in 
the U.S. Constitution is part of a system of checks and 
balances. The impeachment power is an accountabil- 
ity mechanism. Of course, though they always operate 
ex post, accountability mechanisms can exert effects 
ex ante, since the anticipation of sanctions may deter 
the powerful from abusing their positions in the first 
place. 

At a minimum, institutions of governance should 
limit and constrain the potential for abuse of power. 
The problem of abuse of power is particularly serious 
in world politics, because even the minimal types of 
constraints found in domestic governments are absent 
on the global level. Not only is there no global democ- 
racy, but there is not even an effective constitutional 
system that constrains power in an institutionalized 
way, through mechanisms such as checks and balances. 
Lacking institutionalized checks and balances, the prin- 
cipal constraints in world politics are potential coercion 
(as in the balance of power) and the need for states 
and other actors to reach mutually beneficial agree- 
ments. But these constraints are quite weak in restrain- 
ing powerful actors, and they are not institutionalized 


1 “A type of system for the conduct of relationships among states” 
that operates to restore equilibrium when it is threatened by a single 
state, or bloc of states, becoming overwhelmingly powerful (Claude 
1962, 41-3) 
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in generally applicable rules. Accountability mecha- 
nisms of various kinds—whether strictly democratic or 
not—are therefore also required if abuses of power are 
to be limited in nonarbitrary ways that can be defended 
on ethical as well as practical grounds. 

Thus, our focus in this article is on the role of ac- 
countability mechanisms in world politics. What kinds 
of accountability mechanisms are likely to be effec- 
tive in constraining international organizations such 
as the World Bank and International Monetary Fund 
(IMF)? Should we think about accountability in the 
same way when considering powerful entities that are 
structured quite differently from multilateral organi- 
zations—imultinational corporations, NGOs, transna- 
tional organizations, and states? How should we think 
about global accountability when there is no global 
democracy? Accountability mechanisms in world pol- 
itics are not limited to those that are emphasized in 
participatory models of democracy, and even these 
mechanisms should be viewed, not in isolation, but in 
the context of other constraints on the abuse of power. 
How can understanding accountability at the level of 
the nation-state clarify the problem of accountability 
at the global level? 


TWO GENERAL MODELS OF 
ACCOUNTABILITY FOR NATION-STATES 


Accountability functions to expose and sanction two 
sorts of abuses: the unauthorized or illegitimate ex- 
ercise of power and decisions that are judged by ac- 
countability holders to be unwise or unjust. In politics, 
ensuring accountability requires establishing institu- 
tions that provide information to those people trying 
to hold power-wielders accountable and that enable 
them to impose sanctions on the power-wielders. “To 
be accountable means to have to answer for one’s ac- 
tion or inaction, and depending on the answer, to be 
exposed to potential sanctions, both positive and nega- 
tive” (Oakerson 1989, 114). Information and sanctions, 
however, are necessary, but not sufficient, conditions 
for accountability. They presuppose norms of legiti- 
macy that establish, not only the standards by which the 
use of power can be judged, but also who is authorized 
to wield power and who is properly entitled to call the 
power-wielders to account. 

In an effort to clarify different forms of account- 
ability, we describe two theoretical models of account- 
ability. The distinction between them is grounded in 
alternative conceptions of the legitimacy of politi- 
cal authority, which have different implications for 
how accountability is understood. Each of the mod- 
els represents a schematic distillation of a line of ar- 
gument that has been developed over centuries. One 
can find similar views competing in debates between 
Federalists and Antifederalists over the U.S. Constitu- 
tion, for example, as well as in contemporary debates 
over whether international organizations such as the 
World Bank should be held accountable to the states 
that authorized their creation or to poor people who 
are most affected by their policies. We have labeled the 
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TABLE 1. Two General Models of Accountabllity for Nation-States 
Power-Wlelders Regarded as Who Is Entitled to Hold the Powerful Accountable? 
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Instrumental agents 


pa 

la. “Direct democracy”: Actions of 
power-wielders are what those 
affected (the people) instructed 


Those Entrusting Them with 
Powers—Delegation 
lla. Princlpa-Agent: Power-wielders 
act as falthful agents of principals 
who empower them. 


them to do in thls contingency. 


Discretionary authonties 


Ib. Popullst: Policies followed by the 
power-wielders lead to outcomes 
approved by those affected, 


IIb. Trustee: Power-wielders 
perform the duties of thelr 
offices faithfully. 


which leads those affected to 
confer additional powers. 


models a “participation” model of accountability and 
a “delegation” model of accountability.” 

Our two basic models differ fundamentally in their 
answer to the question: “Who is entitled to hold the 
powerful accountable?” In the participation model, the 
performance of power-wielders is evaluated by those 
who are affected by their actions. In the delegation 
model, by contrast, performance is evaluated by those 
entrusting them with powers. In addition, each model 
has two variants distinguished by different understand- 
ings of the relation between the powerful and the 
publics they are meant to serve. Power-wielders are 
viewed either as instrumental agents of the public or as 
authorities with discretion. The result is a 2 x 2 table, 
Table 1. 

Both distinctions in the table are significant. The 
columns reflect the familiar distinction between demo- 
cratic participation and populist theories, on the one 
hand, and theories emphasizing consent, legal autho- 
rization, and office, on the other. In the latter theories, 
it need not be the people as a whole, but could be some 
elite group or institution, that entrusts power-wielders 
with power. The rows distinguish between direction 
and control. As emphasized above, all institutional ar- 
rangements for accountability include an element of 
ex post control. In the second, “discretionary authori- 
ties,” row, reliance is placed entirely on such ex post 
mechanisms. Those who entrust authority to others 
do not expect to direct the power-wielders’ behavior 
beyond defining official duties. Instead, they expect 
to limit the abuse of power and control the power- 
wielders either through a system of checks and bal- 
ances or through ex post monitoring and imposition of 
penalties (such as loss of office) when performance is 
not satisfactory. In the “instrumental agents” row, on 
the other hand, mechanisms to direct the actions of 
power wielders ex ante are also employed. Only when 


2 These terms are a kind of “shorthand,” and the reader should take 
their meaning from the discussion that follows. Terminology varies 
in discussions of models of democracy or representation. Contrast 
Dahl (1956), who speaks of “Madisonian democracy” and “populist 
democracy.” See Krouse 1982 for the tension between “elite” and 
“participatory” democracy in J. S. Mill's writings. See also Arendt 
1963, 237. 





the instructions given to agents are not obeyed is there 
a need for accountability mechanisms. 

We begin with the participation model in the first 
column of Table 1. A fundamental principle here is 
that, ideally, individuals ought to be free to make de- 
cisions for themselves, since nobody can both know 
and care for your interests as well or as much as you 
do. A second principle is that people should be treated 
equally. It follows that where collective decisions are 
required, each person ought to have an equal say. 
Legitimacy depends on full participation. Further, 
public power is legitimate only to the extent that de- 
cisions serve the interests of the people as a whole, 
which is interpreted to mean to the extent that the 
outcomes of decisions reflect what individuals de- 
sire. For these reasons, people with power ought to 
be accountable to those who are affected by their 
decisions. 

On the basis of these principles, direct democracy 
could be defended as the ideal form of government, 
preferable to representative government in principle, 
though that is impossible in most circumstances for 
reasons of scale. For this reason, we refer to this model 
as “direct” democracy (Ia). In this model, as the clos- 
est approximation to the ideal, representatives ought 
to reflect their constituents’ interests and beliefs, and 
this is most likely when representatives are similar to 
their constituents demographically and when represen- 
tative assemblies are large.* Referenda are desirable 
tools in this model, as are term limits and frequent 
elections. Executive officers of government are often 
seen as mere instrumental agents of the legislature with 
little leadership role. High levels of participation in 
politics and in governing institutions are considered 
highly desirable and serve as a direct accountability 
mechanism. Representatives and officeholders need to 
be called to account by the governed, who can have 


3 Pitkin (1967, 146-47) makes a similar distinction between mandate 
and independence theories of representation. 

* Similar 1deas were expressed by the antifederalists during the con- 
test over ratification of the U S. Constitution; see Brutus, the Federal 
Farmer, and Melancton Smith, in Stonng 1981, 39, 114-15, 336- 
37, 340-41, 344-45. For a contemporary description see Cook and 
Morgan 1971, 1-42. 
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some control over the decisions that affect their lives 
through political participation.’ 

The populist alternative (Ib) is also a form of govern- 
ment consistent with the underlying principles of the 
participation model, but one in which there is a much 
greater role for the political leader. In this model, the 
people entrust a leader or a party to speak for the 
interests of the people as a whole against groups in 
society that are understood to be “special interests.” 
Direct participation of the people in governing institu- 
tions is not seen as a primary goal. But the legitimacy 
of the party and its leader depends on the extent to 
which they can credibly speak for the people. Thus, the 
populist leader and party are held accountable to the 
public through frequent appeals to mobilized public 
opinion and through elections that serve as referenda 
on the leader’s or the party’s performance in office 
(Taggart 2000, esp. 65-66). 

The delegation model and its variants (Ia and Ib), 
on the other hand, are grounded in the notion that 
power is legitimate only when it is authorized by the le- 
gitimating consent of those who delegate it. And since 
power is always delegated for a reason, it is legitimate 
only so long as it serves its original purposes, which, in 
the case of the nation, are the protection of nghts and 
the pursuit of the public good. The act of delegation 
is what distinguishes authority from raw power. No- 
body can claim authority over another on the basis of 
personal privilege, wealth, hereditary right, or superior 
force. Those in power hold offices with specified duties 
so that power is not personal but, instead, is associated 
with the authority of office. The central principle of 
accountability here, implied by this conception of po- 
litical legitimacy, is that people with power ought to be 
accountable to those who have entrusted them with it. 
And the standard for recognizing abuses of power will 
be violations of that trust: acting beyond the authority 
of the office or in violation of its purposes (Locke [1690] 
1980, esp. par. 149, 151). 

According to the delegation model but not the par- 
ticipation model, representation is superior to direct 
democracy. One reason is that representation has ad- 
vantages for accountability. It is often said that repre- 
sentation is advantageous because people can delegate 
their power to those most able to govern. It is less often 
noted that representation allows for a separation or dis- 
tance between the governed and their governors that 
allows the former to call the latter to account. When 
particular powers are delegated to officeholders and 
representatives, it is clear whom to blame. And when 
the people are distanced from their representatives, 
they can judge their actions without being implicated 
in them themselves.° 

Two conceptions of accountability are consistent 
with the principles of the delegation model. The 
first, borrowed from economic models of employer/ 


5 See The Port Huron Statement of the Students for a Democratic 
Socety (1962) and the Australian Council of Trade Unions (2002) 
background paper. 

6 Hamilton, Madison, and Jay (1787) 1982, nos. 10 and 49. See Grant 
and Grant 1981, 35-36. 
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employee relations, is the principal-agent model (Ma) 
(Pollack 1997). Political officials (including elected 
representatives) are understood to be “employees” 
(“agents”) of their constituents or their executive su- 
periors (“principals”). In principal—agent models, the 
principal’s preferences are taken as given, and the re- 
lationship is judged a failure if the agent deviates from 
them. In this understanding, accountability is ensured 
when agents have incentives to do what the principals 
want them to do. Agents are seen as instruments of the 
principal’s will, and they ought to act as the principal 
himself would if he held the position (Maravall 1999, 
155). 

Yet the relation of officeholder/public differs in im- 
portant respects from the employee/employer relation. 
First, once appointed or elected, officeholders make 
decisions that govern members of the public; employ- 
ees do not make rules to which their employers are 
subject. The power relations in the case of politics are 
therefore more complex (Rousseau [1762] 1988, I.6). 
Second, unlike most employees, when a politician is 
given a job, he or she is generally given a great deal 
of power and considerable discretion as to how to use 
it. Often, officeholders are entrusted with power on 
account of their superior expertise or judgment. They 
ought to exercise their discretion in performing their 
duties.’ Principal_agent models are inappropriate, for 
example, when calling judges to account. Power is cer- 
tainly delegated to judges, but they are not responsible 
for enacting the will of those who empower them; they 
have different sorts of official duties. In fact, one might 
say that among their duties is the duty to resist enact- 
ing the will of those who empowered them when to 
do so would bend or violate the law. In general, the 
principal-agent model sharply restricts the scope for 
leadership. It does not allow that the role of a leader 
could include defying the transitory popular will or 
forming people’s preferences in the direction of their 
true interests, obligations, or long-term good. 

The trustee model of delegation (Ib), in contrast, 
presupposes that officials will use discretion. Hence, 
the implicit standard for abuse of power differs from 
that implied by the principal-agent model. Deviations 
of the agent’s actions from the principal’s desires would 
not necessarily constitute abuse of power. A repre- 
sentative or officeholder could defend an unpopular 
exercise of power as legitimate by showing that it both 
was within the officer’s jurisdiction and actually served 
the purposes for which he or she was authorized to 
act (Burke [1774] 1949). Accountability mechanisms 
consistent with this model are designed to ensure that 
officials exercise their discretion, which might be con- 
siderable, in accord with their official duties. 

The two basic models, participation and delega- 
tion, lead to different strategies and mechanisms for 
accountability, because they are grounded in differ- 
ent notions of legitimacy. In the participation model, 
those affected hold power-wielders accountable di- 
rectly through participation, whereas in the delegation 


7 For an mteresting discussion of “two logics of delegation,” agency 
and fiduciary, see Majone 2001 
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model, those delegating power hold power-wielders 
accountable through a variety of mechanisms for judg- 
ment after the fact. 

The two basic models of accountability can also lead 
to conflicting political judgments. For example, the IMF 
and the World Bank were created by governments, 
which delegated their powers to them, provide funding, 
and ensure creditworthiness. In their weighted voting 
systems, the major financial powers control the execu- 
tive boards of these organizations. When senior officials 
of the World Bank, such as former chief economist 
Joseph Stiglitz, severely antagonize the U.S. Govern- 
ment, they are forced to resign.® This is as it should be 
where the delegation model of accountability operates. 
Yet at the same time, the World Bank (2000) acknowl- 
edges the importance of “empowerment” of poor peo- 
ple in order to increase the “accountability and respon- 
siveness” of public sectors to them (9). The language 
of empowerment suggests a participatory model of ac- 
countability, the logic of which could easily be extended 
to imply more empowerment within the Bank itself for 
the people who are affected by its policies, whether 
they are represented through state leaders or NGOs. 

There is a clear tension between the concept of a 
World Bank that is accountable to poor people and 
one that is accountable to the U.S. Secretary of the 
Treasury. Similarly, the IMF might be considered ac- 
countable to those whose money it 1s lending to take 
only reasonable risks, which leads to a policy of re- 
quiring structural adjustments. But it is also called 
to account for the effects of those structural adjust- 
ments within the countries accepting the conditions of 
IMF loans. The actual patterns of accountability facing 
the IMF and the World Bank combine practices justi- 
fied on the basis of both delegation and participation 
models: practices which are often in tension with one 
another. 

In liberal democratic states, practices based on these 
alternative models are often fused, and there is no ten- 
sion between them. For instance, democratic elections 
are examples of both accountability through partici- 
pation and accountability through judging the perfor- 
mance of one’s delegates. Thus the process of demo- 
cratic elections can be endorsed readily as an effec- 
tive mechanism for accountability by anyone hold- 
ing any one of the four views articulated here. This 
convergence, however convenient in the context of 
liberal democracies, does not apply in contemporary 
world politics, since democratic elections are not viable 
mechanisms for accountability at the global level. Fail- 
ure to distinguish the theoretical principles involved 
in these models has therefore become a source of 
confusion for the analysis of accountability in world 
politics. Maintaining the analytical distinctions among 
them should help to determine the extent to which 
domestic democratic accountability mechanisms are 
applicable to the problems of accountability in global 
politics. 


8 See Financial Tunes, June 16, 2000, and Stiglitz 2002. 
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IMPLICATIONS FOR ACCOUNTABILITY 
IN GLOBAL POLITICS 


One way in which the delegation—participation distinc- 
tion can help to clarify our thinking is indicated when 
we recognize that many power-wielders in world pol- 
itics acquired their power without delegation. States, 
subunits of states, multinational firms, and NGOs were 
all created either without any act of authorization at 
all or without having been authorized to act by any set 
of entities even remotely representative of the world 
population as a whole. Only international organiza- 
tions had their authority to act in the international 
arena explicitly conferred on them—in this case by 
states. Thinking that accountability is only relevant 
when authority has been delegated can lead observers 
to overlook the accountability deficits of those power- 
wielders, such as states, multinational firms, and NGOs, 
that do not depend for their existence on delegation. 

On the other hand, thinking only about participa- 
tion could lead to different forms of confusion. If one 
conceives of accountability solely in terms of its partic- 
ipatory democratic forms, one is apt to despair of the 
prospects for effective accountability in world politics. 
Since world democracy does not seem feasible, some 
commentators have seen a stark choice: between sacri- 
ficing the benefits of international cooperation for the 
sake of democracy, on the one hand, and accepting rule 
by nondemocratic and unaccountable international or- 
ganizations, on the other (Dahl 1999). Other commen- 
tators, less pessimistically, hold to participatory demo- 
cratic forms of accountability as the ideal, treating the 
problems of global accountability as directly analogous 
to domestic accountability (Falk and Strauss 2000). 

But comprehensive programs to institute democratic 
accountability at the global level on the basis of an anal- 
ogy with domestic democracy founder on the absence 
of a coherent and well-defined global public. At the 
domestic level, those affected by the power-wielders 
are generally understood to be those subject to the 
laws of the polity, so it is fairly easy to determine who 
has a right to participate. But having the right to par- 
ticipate in politics as an affected party is ambiguous 
at the global level. For example, if being affected by a 
decision were sufficient to support a right to participate 
in decision-making, anyone who buys gasoline would 
be entitled to participate in OPEC’s deliberations, and 
anyone affected by world interest rates to participate 
in discussions at the Federal Reserve or the European 
Central Bank. In the absence of a public whose bound- 
aries are defined by participation in a polity, it is very 
difficult to specify either who should be entitled to par- 
ticipate or how they would do so. 

Today, there is no large and representative global 
public, even in the relatively weak sense of a global 
“imagined community” (Anderson 1991)—a transna- 
tional community of people who share a sense of com- 
mon destiny and are in the habit of communicating with 
one another about issues of public policy. Particular 
global publics are indeed emerging—for instance, in 
issue-areas such as human rights and environmental 
protection (Keck and Sikkink 1998)—but they surely 
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are not representative of the world’s people, and they 
are by no means coterminous with the sets of people 
affected by the policies of states, multinational firms, 
or multilateral organizations. 

In order for a global public to function politically, 
there would need to be some political structure that 
would help to define who was entitled to participate, 
and on what issues. In addition, many more people 
would have to identify transnationally and be willing to 
participate as members of a global public.” The number 
of participants would have to be sufficiently large and 
representative, and the means of participating suffi- 
ciently open, that the views of the active public could 
be seen as reflective of the opinions of people in the 
world as a whole to a significant extent. Whether such 
conditions could ever be met or whether the problems 
of scale render global democracy impossible or even 
undesirable will continue to be highly contested ques- 
tions. What we can say with confidence is that, today, 
while there are fragmentary global publics, a genuine 
global public comparable to publics in well-established 
democracies does not exist. 

Another way of making this point would be to say 
that world politics today lacks a public in two distinct 
senses. There is no juridical public on a global level, 
since no legal institutions define a public with authority 
to act globally. There is no sociological global public, 
because only a very small minority of people in the 
world identify and communicate with other people on 
a global basis, or even follow world events very closely. 
This analysis suggests that proposals for global partici- 
patory institutions, such as that of Richard A. Falk and 
Andrew Strauss (2000), are premature at best. There is 
no global demos, and “if there is no demos, there can 
be no democracy” (Weiler 1999, 337).!° 

In democratic nations, the existence of a clearly de- 
fined public provides the responses to the fundamental 
questions about accountability: (1) What constitutes 
an abuse and why? and (2) Who is entitled to hold 
power-wielders accountable and why? Power is abused 
whenever it is used for private or partial interests con- 
trary to the interests of the public. And the public is 
entitled to hold power-wielders accountable in a demo- 
cratic nation for abuses of power, either as the source 
of that power (the delegation model) or as the body 
affected by it (the participation model) or both. On 
the global level, there is no public that can function 
in this way—to provide answers to the fundamental 
questions about accountability and to ground the justi- 
fication for accountability mechanisms of a democratic 


9 The World Values Survey, taken in 70 countries at two different 
periods of the 1990s, indicated that only 15% of respondents viewed 
themselves as identifying primanly with their continental regions or 
the world as a whole Forty-seven percent had principally local and 
regional attachments, while 38% listed their nation-state as their 
principal affihation (Norris 2000, 161-66) 

° David Held (1995, 232) has notably made the case for “cosmopoli- 
tan democracy” as an ideal, in which associations of democratic states 
would lead to people learning “the theoretical lesson that democratic 
legitumacy can ultimately only be redeemed transnationally” We 
are more sympathetic with Held’s argument for an eventual cos- 
mopolitan democracy than with the institutional proposals of Falk 
and Strauss. 
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type. This is the crucial difference between problems 
of accountability globally and domestically. There is 
no simple analogy that can be made between domestic 
democratic politics and global politics. Consequently, 
we must seek elsewhere for answers to these two fun- 
damental questions if there is to be coherently justified 
and practically effective accountability at the global 
level. 

Effective accountability at the global level will re- 
quire new, pragmatic approaches: approaches that do 
not depend on the existence of a clearly defined global 
public. Attention will need to be paid to delegation 
problems: exercising control over agents to whom im- 
portant tasks have been assigned. And the participa- 
tion issue will have to be addressed as well. “Those af- 
fected” might be defined in some cases as those subject 
to a rule or regulation promulgated by a transnational 
organization. In other cases, it might mean publics in 
a particular region. Improved mechanisms for partic- 
ipation could range from processes akin to “notice 
and comment” provisions in administrative law to the 
increased transparency necessary for public scrutiny 
in the media. Increased domestic democracy can be 
an important form of participation in global politics 
in cases where states are the primary actors in inter- 
national organizations or where global policies must 
be implemented by state action. Of course, domestic 
democracy can improve accountability to citizens and, 
at the same time, work against the interests of people 
affected by government policies beyond state borders. 
Indeed, there is a danger that gaps between rich and 
poor countries could be widened by relying on do- 
mestic democratic accountability, because, in general, 
rich countries are more democratic than poor ones. 
Different attempts at increased participation will have 
different advantages and disadvantages. It is well to 
remember also that there are alternative forms of ac- 
countability as well. The International Criminal Court, 
for example, is an accountability mechanism without 
any participatory element. 

Before discussing how specific accountability mecha- 
nisms might function with respect to each of the various 
types of global powers, however, we must address the 
two questions that we have identified as central to any 
theory of accountability: What constitutes an abuse of 
power? and Who is entitled to hold power-wielders 
accountable? But now we need to address them with 
respect to world politics in particular. We need to know 
how standards of legitimacy can be established in world 
politics and what mechanisms are available for empow- 
ering some actors to hold others accountable to those 
standards. 


STANDARDS OF LEGITIMACY IN WORLD 
POLITICS 


World politics is characterized by sharp conflicts of 
interests and values, and by potential or real violent 
conflict in the absence of a common government. In- 
clusive international organizations—which can make 
claims for legitimacy on that basis—are often the most 
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divided for that very reason. Furthermore, many of the 
chief sources of legitimacy at the domestic level, such 
as constitutional mandates, electoral processes, legality, 
and tradition, are not available to transnational orga- 
nizations or to states when they exert coercive effects 
on noncitizens. Hence claims to legitimacy at the global 
level depend on inclusiveness of state participation and 
on general norms of fairness, which have been increas- 
ingly affected by democratic values. 

The standards of legitimacy against which power- 
wielders can be held accountable derive primarily from 
three sets of informal norms. First, legitimacy derives 
from conformity to human rights norms that are widely 
shared by the elites and publics of the most powerful 
states in the global political system and that are main- 
tained by a public discourse. For example, the Kosovo 
Report declared the NATO war against Serbia in 1999 
to be legitimate, because it was designed to protect an 
innocent population, even though it was technically il- 
legal according to the United Nations Charter. The “re- 
sponsibility to protect” innocent populations trumped 
the norm of state sovereignty, even as expressed in the 
Charter.'! Second, there has been increased agreement 
that many normative principles inherent in democracy 
are applicable at the global level (Sen 1999), such as 
the right to exercise voice about domestic governance. 
Third, there has been intense normative pressure on 
the patterns of extreme economic inequality that per- 
meate the contemporary global political economy. 

It follows that power-wielders may be held account- 
able to standards of conduct articulated in transna- 
tional civil society, even though there is no applicable 
international law and even though their power does not 
derive from authority delegated to them. When Nike 
was criticized in the late 1990s for hiring contractors 
using “sweatshop” labor, it had not necessarily broken 
any laws. Activists complained, not of legal violations, 
but of the inequity of workers in the Dominican Repub- 
lic or Vietnam being employed under miserable con- 
ditions, and paid badly, to produce luxury goods that 
would be sold for many times their production costs 
in rich countries. Although some of the human rights 
norms used to hold former leaders accountable in Latin 
America are legalized, they operated also through non- 
legalized social processes involving elites’ desires for 
legitimacy and esteem (Lutz and Sikkink 2000, 658- 
9). When standards are not legalized, we would expect 
accountability to operate chiefly through reputation 
and peer pressure, rather than in more formal ways. 

Another source of legitimacy is law (Franck 1990, 
_ 1995). In traditional international law, those entities 
that are neither states nor international organizat- 
ions—such as firms‘and private associations—are sub- 
ject to the laws of the states possessing jurisdiction. 
But states, and in’ some instances non-state actors, 
also can be held to standards articulated in inter- 
national law. Human rights treaties and the statute of 


1 See Independent International Commission on Kosovo 2000 and 
International Commission on Intervention and State Sovereignty 
2001. 
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the International Criminal Court constitute obvious 
examples: states and their leaders can be held account- 
able for violations of their provisions. The WTO issues 
Tulings binding on states, which must often, in order 
to comply, enact or alter domestic laws or regulations 
applying to firms and their transactions. Even if no 
treaty provisions cover a situation, international law 
includes a category of jus cogens—peremptory norms, 
such as norms against piracy, whose violation justifies 
legal accountability. Finally, customary international 
law reflects state practice “accompanied by the convic- 
tion that it reflects a legal obligation,” known as opinio 
juris (Malanczuk 1997, 44). 

For international organizations, to whom authority is 
formally delegated, legitimacy depends on some com- 
bination of conformity to shared norms and to estab- 
lished law. The predominant legal instruments are char- 
ters or articles of agreement that specify the procedures 
by which they have to act to make their rulings author- 
itative, defining what is often referred to as “input” or 
“process” legitimacy. Equally important, however, the 
charters of international organizations typically spec- 
ify, in general normative terms, their purposes. The 
United Nation was formed “to save succeeding gen- 
erations from the scourge of war,” as well as to pro- 
mote fundamental human rights, create the conditions 
for justice, and promote social progress (UN Char- 
ter, Preamble). These purposes change over time. The 
World Bank was designed in 1946 to assist in postwar 
reconstruction, promote private foreign investment, 
and promote the growth of international trade; only a 
subordinate clause in its Articles of Agreement refers 
to “the encouragement of the development of produc- 
tive facilities and resources in less developed coun- 
tries.” Its Web-site now declares that “today’s Bank 
has sharpened its focus on poverty reduction as the 
overarching goal of all its work.” 1? 

As a practical matter, then, even in the absence of 
a global public comparable to domestic publics, stan- 
dards have begun to emerge in answer to the question, 
“What constitutes an abuse of power?” 


SEVEN ACCOUNTABILITY MECHANISMS 


Our second question, “Who is entitled to hold power- 
wielders accountable for abuses?” receives a variety of 
answers in the practice of global politics. We have iden- 
tified seven discrete accountability mechanisms that 
actually operate in world politics on the basis of which 
improved practices of accountability could be built. 
These accountability mechanisms are summarized in 
Table 2.° Some operate most effectively when stan- 
dards of legitimacy are formally encoded in law; others 


2 International Bank for Reconstruction and Development, 
Articles of Agreement, Article 1. http/Aveb.worldbank.org/ 
WBSITE/EXTERNAL/EXTABOUTUS/EXTARCHIVES  (visi- 
ted September 30, 2003) 

D For earlier discussions of accountability mechanisms, (see 
Keohane 2003 and Keohane and Nye 2003) The former article 
contams a table that 1s very srmular to our Table 2, although Table 2 
omits electoral accountability, as not relevant to contemporary global 
mstitutions, 
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TABLE 2. Seven Mechanisms of Accountabll 


Mechanism 
Hierarchical! 


Supervisory 


Accountabllity Holder 
Leaders of 


organization 
States 


Funding agencies 


Courts 


Equity- and 


Power-Wilelder 
Subordinate official 


Multllateral 
organizaton and 


its executive head 


Funded agency 


Indrvidual official or 
agency 


Firm or government 


In World Politics 
Cost to Power-Wilelder 


Loss of career 
opportunities 
Restraints on ability 


February 2005 


Example 
Authority of UN 
Secretary-General 
World Bank and IMF 


to act, loss of office governance by 


Budget restnctions 


From restriction of 
authonty to 
criminal penalties 

Loss of access to, or 


thelr executive 


boards 
Withholding of UN 

dues 
International 

Criminal Court 


Refusal of capital 


bond-holders and 
consumers 


Peer organizations 


Public reputational’ Peers and diffuse 


public 


Organizations and 
thelr leaders 


Individual or agency 


markets to finance 
developing country 
governments 
during world 
financial crises 
Independent marine 
certification body’s 
evaluation of the 
Greenpeace-Shell 
controversy 
Effects on U.S. “soft 
power” of 
unllaterallsm 


higher cost of. 
capital 


Effects on network 
ttes and therefore 
on others’ support 


2 Reputatonal effects are invotved n all sues of accountability, as mechanisms leading to punishment through hierarchy, supervision, 
fiscal measures, legal action, the markst, and peer responses. The category of public reputational effects refers to situations where the 
other means of accountability are not available, but reputatonal effects are widely known and significant. 


enforce less formal norms. Four of these mechanisms 
rely heavily on delegation: hierarchical, supervisory, 
fiscal, and legal accountability. The remaining three— 
market, peer, and reputational accountability—involve 
forms of participation, although the participants in 
each of these forms of accountability are different. It 
should be noted that these categories should not be 
too rigidly applied: Legal accountability, for example, 
involves a participatory element in any legal system 
that allows citizens to sue powerful entities for failures 
of responsibility. 

Hierarchical accountability is a characteristic of bu- 
reaucracies and of virtually any large organization. Su- 
periors can remove subordinates from office, constrain 
their tasks and room for discretion, and adjust their 
financial compensation. Hierarchical accountability as 
we use the term applies to relationships within orga- 
nizations, including multilateral organizations such as 
the United Nations or the World Bank. 

Supervisory accountability refers to relations be- 
tween organizations where one organization acts as 
principal with respect to specified agents. For instance, 
the World Bank and IMF are subject to supervision by 
states and by institutions within states, such as courts. 
These supervisory relationships are more or less demo- 
cratic as states are more or less democratic. Indeed, 
courts in democracies could demand that international 
organizations such as the IMF and World Bank follow 
procedures that meet due process standards of inter- 
national law (Stewart 2003, p. 459). Advocacy groups 
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within states can put pressure on such organizations as 
the IMF and World Bank through domestic political 
institutions, as environmentalists did with respect to 
World Bank loans for dam projects in the 1980s (Fox 
and Brown 1998). Firms also hold state agencies ac- 
countable, through the political process, for their poli- 
cies in international organizations, such as the WTO, 
whose decisions are relevant to the firms’ interests. 

Fiscal accountability describes mechanisms through 
which funding agencies can demand reports from, and 
ultimately sanction, agencies that are recipients of 
funding. This form of accountability was fundamen- 
tal to the emergence of parliamentary power in Eng- 
land during the seventeenth century and is particularly 
important for international organizations such as the 
United Nations and the World Bank, which rely on 
government appropriations to fund substantial parts 
of their activities. 

Legal accountability refers to the requirement that 
agents abide by formal rules and be prepared to jus- 
tify their actions in those terms, in courts or quasi- 
judicial arenas. Public officials, like anyone else, can 
be “held accountable” for their actions both through 
administrative and criminal law. Courts do not have 
the broad general authority of governments or of elec- 
torates in democracies. Instead, the courts apply a 
narrow version of the trusteeship model (IIb), asking 
whether the power-wielders performed the duties of 
their offices faithfully in a limited sense: whether they 
obeyed the law. Legal accountability has long been 
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important in constitutional democracies and has be- 
come increasingly important in world politics during 
recent years (Goldstein et al. 2001). The WTO Dispute 
Settlement Mechanism, the operations of the Hague 
Tribunal on the Former Yugoslavia, and the creation 
of a new International Criminal Court all illustrate the 
incursions that conceptions of legal accountability have 
made in world politics. 

Market accountability is a less familiar category, but 
an important one. We want to emphasize that this form 
of accountability is not to an abstract force called “the 
market,” but to investors and consumers, whose in- 
fluence is exercised in whole or in part through mar- 
kets. Investors may stop investing in countries whose 
policies they dislike or, at least, demand higher rates 
of interest (Mosley 2003). Consumers may refuse to 
buy products from companies with bad reputations for 
labor standards or other practices, as well as from com- 
panies with inferior or costly products. 

Peer accountability arises as the result of mutual eval- 
uation of organizations by their counterparts. NGOs, 
for example, evaluate the quality of information they 
receive from other NGOs and the ease of cooperating 
with them. Organizations that are poorly rated by their 
peers are likely to have difficulty in persuading them 
to cooperate and, therefore, to have trouble achieving 
their own p 

Public reputational accountability is pervasive be- 
cause reputation is involved in all the other forms of 
accountability. Superiors, supervisory boards, courts, 
fiscal watchdogs, markets, and peers all take the repu- 
tations of agents into account. Indeed, reputation is a 
form of “soft power,” defined as “the ability to shape 
the preferences of others” (Nye 2004, 5). The category 
of public reputational accountability is meant to apply 
to situations in which reputation, widely and publicly 
known, provides a mechanism for accountability even 
in the absence of other mechanisms as well as in con- 
junction with them. 

There are also processes that do not meet the stan- 
dards for accountability but that serve to constrain 
power. As in a system of checks and balances, over- 
lapping jurisdictions or interest areas may require ac- 
tors to compromise with one another to secure the 
cooperation necessary to define or implement policy. 
Such a situation is characterized by negotiation con- 
straints. Since our ultimate goal is to provide greater re- 
straints on the abuse of power, negotiation constraints 
are properly included in this analysis, though they are 
not accountability mechanisms and, therefore, are not 
listed in Table 2. 


ACCOUNTABILITY CONSTRAINTS IN 
WORLD POLITICS 


How do these mechanisms constrain the major power- 
wielders in world politics? The most accountable 
of these power-wielders are probably those toward 
which, ironically, the most accountability-criticism is 
directed: multilateral organizations such as the WTO, 
IMF, World Bank, European Commission, and United 
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Nations. State officials closely supervise all of these 
entities. Indeed, these organizations are constrained 
from the start since the legitimacy of their actions is 
often judged simultaneously by three sets of potentially 
conflicting standards: whether they serve the interests 
of their member states, the purposes for which they 
were established, and evolving standards of benefits 
and harms. 

The criticisms of these organizations on grounds 
of accountability refers not to lack of accountability 
per se, but to weaknesses in democratic accountability 
(Dahl 1999). Even then, often the version of demo- 
cratic accountability that is contrasted with actual insti- 
tutional practice is a highly idealized one. As Andrew 
Moravesik (2002) argues with respect to the European 
Union: 

Most critics compare the EU to an ideal plebiscitary or 

parliamentary democracy, standing alone, rather than to 

the actual functioning of national democracies adjusted 
for its multi-level context. When we conduct the latter 
sort of analysis, we see that EU decision-making proce- 
dures, including those that insulate or delegate certain 
decisions, are very much in line with the general prac- 
tice of most modern democracies ın carrying out similar 
functions. (621-22) 


If we focus not solely on democratic accountability, 
but on accountability more broadly, we can see that 
supervisory accountability, with states as the account- 
ability holders, is quite strong for multilateral organi- 
zations. For example, the World Bank has both an ex- 
ecutive board, composed of state representatives, that 
makes policy, and an inspection panel that is designed 
to ensure that its policies are actually followed within 
the organization. Fiscal accountability is also strong, 
since multilateral organizations depend on subven- 
tions from states. Both of these forms of accountability, 
which follow the delegation model, have been strength- 
ened in recent years (Nielson and Tierney 2003). The 
Bank’s uses of participatory forms of accountability are 
less formalized, but one can see evidence of participa- 
tory accountability in the Bank’s efforts to enter into a 
dialogue with NGOs (implying a form of peer account- 
ability) and in its efforts, clearly evident on its web site, 
to justify its policies to a broader public (manifesting 
reputational accountability). 

Multilateral institutions are, indeed, highly con- 
strained by accountability mechanisms.'* The problem 
is not a lack of accountability as much as the fact that 
the principal lines of accountability run to powerful 
states, whose policies are at odds with those of their 
critics, and which may or may not themselves be fully 
democratic. Publics within countries are not heavily 
involved in these processes. 

NGOs are often seen as the entities to which power- 
wielders such as the World Bank should be held ac- 
countable. In this view, they are the “virtual represen- 
tatives” of publics adversely affected by other global 


14 This is not to say that accountability is perfect by any means (Fox 
and Brown 1998; Woods 1998, 2001). 
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power-wielders. Increasingly, NGOs are formally rep- 
resented at international meetings, often with specific 
rights and privileges. However, international NGOs are 
not legitimated by ties to a defined public. In practice, 
few NGOs have well-defined procedures for account- 
ability to anyone other than financial contributors and 
members—quite a small set of people. They can iden- 
tify elements of what they consider to be “the public 
interest,” but what constitutes the public interest is 
contested. The fact that NGOs are typically the result 
of entrepreneurial initiatives by activists means that 
they do not result from a process of delegation. As 
a result, there is no organization that endowed them 
with powers formally responsible for holding them to 
account. 

Even NGOs whose members may see themselves 
as pure-spirited guardians of the public interest can 
abuse their power—which for NGOs is usually lim- 
ited to their ability to provide information and mold 
public opinion. Greenpeace, for instance, demanded in 
1995 that Shell not dispose of its Brent Spar drilling 
rig in the North Sea, alleging that it contained 5000 
tons of oil, vastly more than Shell estimated. After a 
huge media campaign, very costly to Shell’s reputation, 
Greenpeace had to admit that it had greatly overesti- 
mated the amount of oil on board. Eventually, an inde- 
pendent marine certification body broadly confirmed 
Shell’s original estimates (BBC 1998). 

NGOs are fiscally accountable to their major contrib- 
utors, who can demand reports. Insofar as they depend 
on a small number of donors who monitor their behav- 
ior, these donors may be seen as delegating authority 
to them through their dollars and as capable of holding 
them to account by withdrawing their support. Only 
in this very limited sense is the concept of delegation 
operable with respect to NGOs. NGOs that appeal to 
a large public are largely subject to participatory forms 
of accountability. Although the public pays attention 
only intermittently, NGOs have to maintain their rep- 
utations, which may imply being concerned also about 
peer accountability. Furthermore, they operate in a 
“market for donors,” so are in that sense subject to mar- 
ket accountability. The judgment of the independent 
marine certification body, about Greenpeace’s claims, 
is a good example of peer accountability. Too many 
false alarms lead such NGOs to be ignored or scorned 
by other NGOs with which they compete, by the press, 
and by individuals in various societies, on which they 
depend for contributions and political support. In gen- 
eral, the weakness of NGOs—their dependence on 
reputation and funding and their lack of coercive force 
or huge material resources—makes the lack of formal 
accountability mechanisms for them less likely to lead 
to serious abuses of power than is the case for states. 
However, as NGOs become stronger, with credibility 
that is not easily shaken even as they make false or 
prejudicial claims, their lack of such mechanisms of 
accountability, apart from legal provisions within states 
against fraud, becomes a more serious issue. 


15 See “The Democratic Accountability of Non-Governmental Or- 
ganizations,” symposium proceedings published in the Chicago Jour- 
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Firms, including media firms, may be limited in some 
ways by their own stockholders, but there is little reason 
to believe that stockholders will object to behavior that 
benefits the firm at the expense of outsiders. Manage- 
ment wields power delegated from stockholders, but 
since stockholders share certain narrow institutional 
interests (for instance, in profits), they do not substitute 
for effective external accountability. Firms are subject 
to domestic supervisory and legal accountability, since 
they are chartered by states. But at the global level they 
are only subject to broad participatory accountabul- 
ity, through reputational accountability and in related 
ways, through the market. Firms with brand names to 
protect may be quite vulnerable to consumer boycotts, 
as the movement against sweatshops, focused for a time 
on Nike, demonstrated. Media firms are particularly 
subject to reputational accountability. Commenting on 
the resignations of the two top editors of The New York 
Times in June 2003, a commentator observed, “They, 
of course, had to resign. Any company has to sell the 
credibility of its product, but a media company has 
nothing else to sell.”!© Firms whose reputations suffer 
because of their real or alleged behavior are likely also 
to suffer in the marketplace. They may therefore have 
incentives to adopt codes of conduct in order to protect 
both their reputations and their market share (Garcia- 
Johnson 2000). 

A recent systematic analysis of accountability by the 
One World Trust (2003) rates various multilateral or- 
ganizations, multinational firms, and NGOs on eight 
criteria, four relating to “internal stakeholder account- 
ability” and four to “external stakeholder accountabil- 
ity.” The internal dimensions include member control, 
procedures for appointment of senior staff, compliance 
mechanisms, and evaluation processes; the external 
dimensions include external stakeholder consultation, 
complaint mechanisms, corporate social responsibility, 
and access to information. Seven NGOs, six firms, and 
five multilateral organizations were evaluated, using 
the same standards, In the summary graphs provided 
in the report, scores for member control and access to 
information were combined. Interestingly, there were 
no significant differences in composite accountabil- 
ity ratings among these three types of organizations. 
Organizations of all three types ranked toward the top, 
and toward the bottom, of the list.” 

In many issue-areas, harmonized policies are worked 
out among functionally defined umts of different gov- 
ernments, such as securities regulators, central bankers, 
and antiterrorist units (Keohane and Nye 1974; 
Reinecke 1999-2000; Slaughter 2004). These transgov- 
ernmental networks do not provide mechanisms for 
either delegated or participatory accountability. Since 
these networks are informal, it is often unclear which 


nal of International Law, Vol 3, No. 1 (Spnng 2002), with contribu- 
tions by Paul Wapner, Peter J. Spro, Debora Spar and James Dail, 
and Benedict Kingsbury 

16 The quote 1s from Warren L. Batts, Adjunct Professor at the Uni- 
versity of Chicago School of Business (The New York Tunes, June 6, 
2003, p. A 28) 

17 The average ranking for all three types of organization was be- 
tween 9 and 10. 
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organizations have delegated powers to them. Further- 
more, participatory accountability is minimal: The gen- 
eral public is not involved, and transparency is typically 
lacking. Abuses of power might in some instances be 
controlled by the fragmentation of power and conflicts 
of interest between the participants, but cooperation 
among the members of such networks could easily be- 
come collusion against the interests of outsiders. 

In our terms, there is some peer accountability 
within transgovernmental networks, since the entities 
involved may demand information from one another 
and sanction other entities for perceived misbehavior. 
But there are no clear mechanisms of accountability, 
because accountability requires that there is a pub- 
lic standard of legitimacy to which political actors are 
held. On the other hand, there is the potential here for 
negotiation constraints. The power of an entity in the 
network may be checked insofar as abuses are against 
the interests or principles of the other entities within 
the transgovernmental networks. Diversity among the 
parties is a precondition for constraint; without it, col- 
lusion is the more likely possibility. 

Finally, what about constraining abuses of power by 
states? States vary a great deal in their susceptibility to 
accountability mechanisms, so we consider separately 
three categories of states: (1) weak, dependent states, 
such as those poor countries that rely heavily on foreign 
aid for budget support; (2) independent states that are 
not great powers; and (3) preat powers, particularly 
hegemonic states. With respect to the latter two cate- 
gories, it is important to distinguish between autocra- 
cies and democracies. 

Weak and dependent states may be subject to fiscal 
and supervisory accountability, often through inter- 
national organizations such as the IMF and the World 
Bank or, in cases of state breakdown (as in parts of 
the former Yugoslavia), the United Nations. Typically, 
states delegate extensive authority to international or- 
ganizations to supervise their behavior only when they 
find themselves in great difficulty. Even weak states 
often succeed at evading the mandates of the World 
Bank and IMF, but the overall structure of power guar- 
antees a substantial measure of accountability.'8 

For states that are fiscally and legally independent 
but not great powers, much depends on their inter- 
nal regimes as well as the extent to which they are 
enmeshed in networks of interdependence. It was ex- 
traordinarily difficult even for the great powers, act- 
ing through the UN Security Council, to hold Saddam 
Hussein’s Iraq accountable for its weapons of mass de- 
struction, and in 1998 Iraq expelled the UN inspectors 
from the country. Self-financing autocracies that do not 
depend much on the rest of the world, such as North 
Korea and Myanmar, or that can generate sources of 
funds with which to purchase goods illegally, such as 
Iraq under Saddam Hussein, are very hard to hold 
accountable. 

For democratic states involved in multiple relation- 
ships of interdependence, control of abuses of power 


18 For a discussion of accountability on environmental issues, see Fox 
and Brown 1998 
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is easier. Negotiation constraints are most important 
for these states, since even quite powerful states need 
cooperation from others, even in the absence of insti- 
tutionalized accountability mechanisms (Keohane and 
Nye 2001). The more they are enmeshed in networks 
of interdependence, the more they are likely to try to 
avoid being out of step with most of their peers, as the 
example of members of the European Union certainly 
illustrates (Moravesik 1998). However, the more pow- 
erful the state, the less constraining may be negotiation 
constraints on any individual issue. 

The most complex issues arise with respect to very 
powerful states with constitutional democratic govern- 
ments, such as the United States. Such governments 
are accountable to their citizens and to an array of 
domestic interests and institutions, but as we have 
noted, this does not assure accountability to outsiders. 
Large and powerful states do not depend on subven- 
tions from others or on markets, and there is no strong 
international legal structure governing their actions, 
despite the new International Criminal Court. Indeed, 
as recent U.S. behavior shows, such states often resist 
international legal accountability. The only forms of 
external accountability to which they are consistently 
subject, across a range of issue areas, are peer account- 
ability and reputational accountability. These attempts 
at accountability, however, depend on efforts, often 
ad hoc, to establish a basis of legitimacy on which 
to hold a state accountable. In the debates over war 
with Iraq in the winter and spring of 2003, for ex- 
ample, France, Germany, and Russia engaged in “soft 
balancing”—seeking to constrain the United States by 
denying legitimacy to its attack on Iraq. However, as 
this episode shows, it is difficult to impose peer or rep- 
utational accountability without a firm consensus on 
what constitutes legitimate behavior. If the powerful 
state controls substantial material resources, includ- 
ing force, and if it has strong internal legitimacy—so 
that its public does not react negatively if its leaders 
are criticized abroad—it may be largely immune from 
sanctions, as the United States was in 2003.19 

The major exception to the generalization that pow- 
erful states do not delegate powers of supervisory ac- 
countability over their own actions is the WTO. Ironi- 
cally for those who attack the WTO as unaccountable, 
it represents one of the few attempts in world politics 
to formalize legal and supervisory accountability over 
the actions of sovereign states. 

This discussion may clarify the role of transparency in 
accountability. The availability of information is crucial 
for all forms of accountability, but transparency, or the 
widespread availability of information, is essential to 
market, peer, and reputational accountability, as well as 
to the internal workings of democratic accountability in 
states which play a supervisory role. Public reputational 
accountability, for example, relies almost exclusively 
on transparency. But transparency is not sufficient for 
effectiveness. Without standards and sanctions—and 


19 For some suggestions designed to improve accountability on is- 
sues of military intervention coming before the United Nations, see 
Buchanan and Keohane 2004. 
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TABLE 3. Power Wlelders and Constraints 


Transgovernmental networks 


States 


a configuration of power that enables sanctions to 
be imposed relatively consistently on all violators of 
standards—accountability that is both effective and 
widely viewed as legitimate will remain elusive.” 

The above discussion is summarized in Table 3, which 
lists the five types of power-wielders that we have dis- 
cussed and the constraints that are likely to be most 
significant for each. Two interesting conclusions can be 
drawn from Table 3. First, multilateral organizations 
and democratic states are the only types of organization 
in world politics consistently subjected to delegated as 
well as participatory accountability. Second, as a result, 
multilateral institutions are in general more account- 
able than NGOs, firms, transgovernmental networks, 
and nondemocratic states—not less accountable, as a 
casual reading of the critical democratic literature on 
accountability might suggest. 

Of course, the appropriateness and efficacy of any 
of our mechanisms for accountability will depend on 
the particular context. For example, firms that depend 
on brand-name products can be expected to be more 
constrained by reputational accountability than other 
firms. Moreover, with respect to states, the actual char- 
acter of accountability will depend as much on internal 
institutions, especially those of democracy, as on ac- 
countability mechanisms at the global level. The fact 
that we have emphasized the latter should not be in- 
terpreted as implying that we downgrade the signi- 
ficance of internal democracy. Elections are the prin- 
cipal means by which ordinary individuals can have 
an impact on policy. The fact that global democracy is 
infeasible does not mean that domestic democracy is 


20 We are therefore less optimistic than Ann Florin: (2003), who 


argues that “transparency is providing new opportunities both to en- 
force rules and standards and to hold accountable those who purport 
to act in the public mterest” (196). 
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Relevant International and 
Transnational Accountability 


Power-Wielder Mechanisms Strength of Constraints 
Multlateral organization Delegated and participatory: Strong 

Supervisory, fiscal, hlerarchical, 

reputational 
NGOs Mostly participatory’ Peer, Strong when NGO Is weak; 


reputational, market 
Mostly participatory: Peer 


negotlation constraints 
Firms Mostly participatory: Market, Stronger for brand-name firms 
reputational in consumer markets and medla 


Extemal accountability mostly 
participatory: Peer, reputational 
Delegated accountability 


(supervisory and fiscal) only for negotation constralnts are more 
weak, dependent states who have Important than accountability 
accepted supervision as a mechanisms. 


condition for support 
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weaker when NGO ls strong 
Quite weak, more a matter of 


firms than for more anonymous 
firms 

Varles with state power and 
degree of interdependence. For 
relatively powerful states, 


unimportant. What we do emphasize, however, is that 
domestic democracy is insufficient. Even democratic 
states will act in a biased way toward noncitizens, The 
mechanisms of accountability that can work so well to 
make governments responsive to their own citizens also 
can work against the interests of noncitizens affected 
by government policies. 

Nothing in our discussion is meant to imply that 
accountability mechanisms in world politics work op- 
timally, or even very well. We have emphasized that 
democratic standards for accountability are not met 
at the global level. Accountability in world politics is 
inextricably entangled with power relationships. Each 
of the seven mechanisms of accountability that we have 
identified empowers different actors. Those who would 
hold power-wielders to account need power them- 
selves. Weak actors—including small, poor countries 
in the Global South and, more, their often disenfran- 
chised publics—lack the capacity systematically to hold 
powerful actors accountable. Furthermore, account- 
ability is relatively haphazard. Nothing guarantees that 
the issues brought to public view are the most impor- 
tant actual or potential abuses of power in the world ata 
given time. There is no working torts system that would 
promote consistent legal accountability, or any sort 
of world “ombudsman.” Ironically, the accountability 
mechanisms that do exist may sometimes have per- 
verse effects. Fear of adverse publicity can discourage 
bold action that could save lives and otherwise promote 
human values. Having been lambasted for building big 
dams, the World Bank is likely to be very reluctant to 
build any big dams again, even if some such projects 
would be beneficial for development. 

In world politics, accountability for most power- 
wielders is likely to be less constraining than is 
optimal. Power-wielders certainly cannot be ex- 
pected to hold themselves accountable—they resist 
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accountability because it restricts their autonomy. But 
the answer is not simply “more accountability.” In- 
stead, we need more intelligently designed accountabil- 
ity systems that are sensitive to the variety of possible 
mechanisms of constraint, the shortcomings of existing 
accountability mechanisms, and the normative claims 
of those adversely affected by global entities. 


CONCLUSIONS 


We do not expect that the debate over accountabil- 
ity in world politics will end any time soon. In fact, 
we have argued that this debate is fueled by a fun- 
damental tension between claims derived from dele- 
gation models and claims derived from participation 
models of accountability. Attending to the ways in 
which these claims can and should be accommodated 
and balanced differently with respect to various global 
power-wielders can open new prospects for improving 
accountability to constrain abuses of power. 

Recognizing new possibilities for accountability re- 
quires abandoning the belief that global accountabil- 
ity, to be genuine, must conform to abstract, maximal 
principles of democratic participation. Such a belief 
prevents us from recognizing specific opportunities for 
limiting abuses of power. We suggest instead that ac- 
countability is only one way of constraining power; that 
there are many forms of accountability that are not 
particularly unique to democracy; and that there are 
various ways of conceiving of democratic accountabil- 
ity, including delegation models as well as participation 
models. Participation is an important value. Indeed, 
the activist participation of individuals and NGOs in 
debates on global governance has put accountability 
on the global agenda, and efforts should be made to 
facilitate greater participation in global governance, 
particularly through transparency. But with respect to 
those entities to which power was delegated, establish- 
ing minimal standards for the accountability of power- 
wielders to those that delegated power in the first place 
can achieve positive results by limiting corruption, 
fraud, and abuse of power, even if broad participation 
is not achieved. 

We should be seeking to create processes for check- 
ing abuses of power with the full recognition that ev- 
ery type of power is subject to abuse. Improved ac- 
countability mechanisms should be explored, but other 
means of constraining power ought to be considered 
as well. And finally, we must recognize that there is 
no single “problem of global accountability”; there are 
many. The point is not to design a comprehensive, ideal 
accountability system but, rather, to figure out how to 
limit abuses of power in a world with a wide variety of 
power-wielders and without a centralized government. 

If we focus on the conditions for the operation of a 
variety of accountability mechanisms, rather than on 
pure democratic accountability, we will see opportu- 
nities for feasible actions to improve accountability. 
Each of the components of accountability at the global 
level exists to some extent: standards, sanctions, and 
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information. Each could be improved, in combination 
with progress on the others. 

The example of agricultural subsidies and the WTO 
illustrates the importance of standards of behavior. 
There is an emerging consensus that rich countries’ 
agricultural subsidies, which are seven times as great 
in magnitude as all official development aid (Wolf 
2004), and which severely harm very poor countries, 
cannot be justified in terms of either efficiency or eq- 
uity. Governments of poor countries have been at- 
tacking such subsidies in the Doha Round, and their 
rich-country counterparts have had difficulty defend- 
ing them (partly because they have been active for 
years in criticizing agricultural protectionism by other 
rich countries). As a result, the WTO—whose policies 
reflect prevailing norms insofar as they are accepted 
by powerful governments—has been moving, haltingly, 
toward restricting agricultural subsidies. Here is an ex- 
ample of the kind of accountability process that we 
have been describing. This accountability process is not 
particularly democratic, nor is it pure: It works partly 
because of negotiation constraints arising from the re- 
fusal of developing countries to make trade agreements 
without concessions on agricultural subsidies. But peer 
and reputational forms of accountability are certainly 
at work here. 

The second condition for accountability is the avail- 
ability of sanctions. Any effective sanctions in a fea- 
sible global accountability system will have to be de- 
centralized, since there is no central government to 
impose them. But decentralized sanctions are quite fea- 
sible where incentives are strong to impose them, and 
where the capacity to do so is distributed fairly widely. 
The history of the international trade regime demon- 
strates that among relative equals in power, decentral- 
ized sanctions can work (Barfield 2001). Indeed, many 
people worry that the ability of the WTO to impose 
sanctions has recently become too strong, insofar as 
WTO rules override local democratic control. How- 
ever, sanctions remain the weak point in global ac- 
countability since they can only be implemented by the 
powerful—tfor example, by powerful states over mul- 
tilateral organizations to whom they have delegated 
power. As we have emphasized, the principal means 
in domestic democracies of participatory sanctions— 
elections—is not available on a global basis. 

The final component of accountability, information, 
may be the easiest to achieve. Crucial to the efficacy of 
an information system for controlling abuses of power 
is that control over it not be limited to power-wielders 
and the entities that originally authorized their actions. 
On the contrary, the system should be open to new 
groups, seeking to provide information relevant to the 
question of whether power-wielders are meeting ap- 
propriate standards of behavior—and to make that 
information widely available. The decentralization and 
diversity of world politics make transparency easier to 
achieve than in previous generations, since the means 
of modern communication, and their diffusion across 
societies, give a wide variety of voices the capacity to 
express themselves (Florini 2003). Furthermore, the 
costs of providing information through web sites are 
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now so low that it is difficult to use cost or inconve- 
nience as an excuse; people around the world are in- 
creasingly used to being able to get the information that 
they want almost instantaneously. The transparency 
of major multilateral institutions, including the IMF, 
World Bank, and WTO, has markedly increased over 
the past decade. Markets are increasingly demanding 
transparency from finance ministries of states, and from 
firms. NGOs must also be increasingly transparent if 
they are to remain credible. If an organization does 
not reveal information itself, others are likely to spread 
information, in more critical forms, about it. The ca- 
cophony and disagreement endemic to world politics 
mean that it may be more feasible to establish an effec- 
tive information system for accountability in this do- 
main than to establish it in many national states—either 
in those with repressive governments or in those with 
uniformity of opinion on a set of important issues. 

A prerequisite to improving accountability in world 
politics is to think about it clearly. There are two key 
questions about abuses of power and accountability: 
(1) What constitutes an abuse and why? and (2) Who is 
entitled to hold power-wielders accountable and why? 
The analysis of two competing models of accountability 
makes clear that these questions do not have a single 
answer. Power wielders can be called to account for 
failing to fulfill their official duties or for failing to serve 
the interests of those affected by their actions. And they 
can be called to account by those who authorized them 
as well as by those affected by them. Both delegation 
and participation models are important: An effective 
accountability system should combine elements from 
both. But strict analogies from domestic democratic 
politics should be regarded with skepticism, and we 
should resist the temptation to narrow the issue of 
accountability to that of democratic control. Ingenu- 
ity in devising effective mechanisms, and the ability to 
synchronize their operation, will be more important 
for controlling abuses of power than a single-minded 
and mechanical application of the ideals of democracy. 
Our aspiration should be to create and support various 
kind of accountability systems in world politics based 
on an appreciation of the competing and complementry 
possibilities within democratic theory. 
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Does Collective Identity Presuppose an Other? On the Alleged 


Incoherence of Global Solidarity 
ARASH ABIZADEH McGill University 


g to which seeing myself as a self requires recognition by an other 


o arguments apparently support the thesis that collective identity presupposes an Other: the 
recognition argument, accordin 


whom I also recognize as a self (Hegel); and the dialogic argument, according to which my sense 
of self can only develop dialogically (Taylor). But applying these arguments to collective identity involves 
a compositional fallacy. Two modern ideologies mask the particularist thesis’s falsehood. The ideology 
of indivisible state sovereignty makes sovereignty as such appear particularistic by fusing “internal” 
with “external” sovereignty; nationalism imagines national identity as particularistic by linking it to 
sovereignty. But the concatenation of internal sovereignty, external sovereignty, and nation is contingent. 
Schmitt’s thesis that “the political” presupposes an other conflates internal and external sovereignty, while 
Mouffe’s neo-Schmittianism conflates difference (Derrida) with alterity. A shared global identity may 
face many obstacles, but metaphysical impossibility and conceptual confusion are not among them. 


6 6 ur common humanity will never make us 
O members of a single universal tribe. The 
crucial commonality of the human race 
is particularism” (Walzer 1992, 171). The particularist 
thesis expressed here by Michael Walzer, about the 
inherent particularity of collective identity, solidarity, 
or community, has become so widespread, common- 
place, even self-evident, that it seems only to require 
being stated to win assent. It is a thesis with devastat- 
ing implications for the possibility of a democratically 
legitimated cosmopolitan political order buttressed by 
a sense of identification or solidarity with humanity 
as a whole! As Dominique Schnapper (1994, 183) 
puts it: “Collective attachments are always affirmed 
in opposition to others. How could the sentiment of 
belonging to humanity be opposed to an Other? What 
subjective meaning could the idea of belonging to hu- 
manity have?” For Chantal Mouffe (2000, 13, 41), the 
premise that “collective identities can only be estab- 
lished on the mode of an us/them” all but guarantees 
the anti-cosmopolitan conclusion that “there can never 
be a democracy of mankind.” And again, according to 
David Miller (1989a), the view that community might 
embrace all of humankind “neglects the fact... that 
communities just are particularistic. In seeing myself as 
a member of a community, I see myself as participat- 
ing in a particular way of life marked off from other 
communities by its distinctive characteristics” (67-68, 
emphasis mine). The particularist thesis apparently en- 
joys the status of a conceptual or metaphysical truth. 
It is not just political theory, but also international 
relations, comparative politics, sociology, and anthro- 
pology, that regularly operate under the shadow of the 
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particularist thesis.* What I want to demonstrate here 
is that the particularist thesis, construed as a conceptual 
or metaphysical claim about the inherent nature of col- 
lective identity, solidarity, or community, does not stand 
to reason. It is one thing to say that identity presup- 
poses difference, it is quite another to say that it presup- 
poses an external other. Though so widely believed that 
it is rarely explicitly stated, much less defended, I shall 
demonstrate that the central arguments often thought 
to support the particularist thesis are fallacious. This 
is of capital importance because (even tacit) concep- 
tual or metaphysical claims about collective identity 
set feasibility limits on both normative and empirical 
theories of politics—in this case, on normative theories 
of democracy and empirical theories of international 
relations and intergroup conflict. 

The precise nature and significance of these feasibil- 
ity limits depend on whether the particularist thesis is 
interpreted in a weaker or stronger way. The weaker 
way claims that a collectivity inherently requires the ex- 
istence of some external other in contrast to which it can 
define itself On this interpretation, some individuals 
must necessarily be excluded in order for a collectivity 
to exist. But this exclusion claim says nothing about 
the nature of the other-relation: that relation may be 
(some combination) of love, admiration, competition, 
resentment, hostility, etc. The second, stronger version 
of the particularist thesis goes further and specifies the 
nature of the relation to the other that the constitu- 
tion of a collectivity supposedly requires: a relation 
of either antagonism or hostility. On this interpreta- 
tion, a collectivity inherently requires adversarial ex- 
clusion: the existence of some external other against 
which it can define itself. This Schmittian version 
thus combines the exclusion claim with an adversary 


My initial focus is on the exclusion claim common 
to both versions. This component of the particular- 
ist thesis already has an important consequence for 
democratic theory: as the authors cited above suggest, 


2 For example, Brown 2001, 131, Huntington 1996, 20-21, 67, 
Schnapper 1994, 183, 106, and Eriksen 1995, 427, respectively. 
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the particularist thesis raises an insurmountable barrier 
for cosmopolitan theories of democracy. Even theorists 
who do not explicitly state and defend the particularist 
thesis as a conceptual or metaphysical truth can be 
shown to subscribe to it tacitly by virtue of the pu- 
tatively insurmountable anticosmopolitan normative 
conclusions they draw from it. 

A second consequence is supplied by the stronger, 
adversarial version of the particularist thesis. The truth 
of the particularist thesis, on this interpretation, implies 
that intergroup relations are inherently characterized 
by friend/enemy conflict or antagonism. This is the 
point of departure for some of the most influential 
analyses of politics today—from so-called realist in- 
ternational relations theory’ to Samuel Huntington’s 
“clash of civilizations” thesis.* The upshot for norma- 
tive democratic theory is, in turn, that democracy in- 
herently depends on the identification of an enemy 
or adversary: if the particularist thesis rules out cos- 
mopolitan democracy, its stronger version implies that 
global politics will always be characterized by clashes 
between enemies. 

My argument against the first anticosmopolitan con- 
sequence is that the exclusion claim, far from reflecting 
a conceptual or metaphysical truth—one that raises 
an insurmountable barrier to overcoming the current 
Westphalian interstate political order—derives much 
of its apparent plausibility precisely from two features 
of the current Westphalian interstate order itself. For 
under the current interstate system, collective identi- 
ties often do constitute themselves in contrast to an 
external other. But this is a contingent empirical fact: 
to reify it is to bar the possibility of overcoming the cur- 
rent anticosmopolitan order on the mistaken grounds 
that it is conceptually or metaphysically impossible to 
do so. I suggest that this reification is in part achieved by 
one of the Westphalian order’s most powerful ideolo- 
gies: the ideology of indivisible sovereignty. The par- 
ticularist thesis, in other words, masks a reification of 
contingent processes of collective identity formation 
that, within the Westphalian order, it is tempting to 
redescribe ideologically as natural features of the uni- 
verse. But it is the current anticosmopolitan order itself 
that ideologically imagines itself to be the necessary 
consequence of the nature of things, in part via the 
reification involved in the particularist thesis. 

I similarly argue that the second, stronger view— 
that intergroup relations are inherently conflictual— 


3 See, for example, Gilpin’s (1986, 304-5) outline of the basic 
pees of the “realist” tradition. 

Huntngton approvingly cites a fictional character to the effect 
that “there can be no tme fnends without true enemies” and that 
“unless we hate what we are not, we cannot love what we are.” 
In his own words Huntington (1996) says that “for peoples seeking 
identity . enemies are essential, and the potentially most dangerous 
enmuities occur across the fault lines of the world’s major civilizahons” 
(20; cf. 21, 67) It is true that there 1s some evidence that Huntmgton 
does not think he 18 making an ahistorical claim about the nature 
of identity as such But this historical sensitivity does not lead 
Huntington to revise his view that intergroup relations are inherently 
characterrzed by a friend/enemy dynamic, what is a contingent his- 
torical fact is sumply that today’s most politically salient fnend/enemy 
“fault tines” are avdizaronal (20-29, 43). 
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mischaracterizes politics by reifying the causes of con- 
flict and antagonism. These causes are not to be found 
in the inherent nature of collective identity-formation 
as such but, rather, in the contingent circumstances in 
which particular identities are formed, circumstances 
that it is the task of social scientists to investigate em- 
pirically. Theorists such as Mouffe, who ground their 
defence of agonistic democracy in collectivities’ puta- 
tive need for an other, are conflating the Schmittian 
language of alterity with the Derridean language of 
difference. 

To expose these reifications is to open up the possi- 
bility of conceiving collective identity and, indeed, of 
(popular) sovereignty in cosmopolitan terms. This pos- 
sibility runs counter to even some currents of construc- 
tivism in recent international relations theory. While 
“realists” have traditionally taken the exclusionary and 
adversarial nature of the state for granted, construc- 
trvists, because concerned to explain how states come 
to acquire their identities, have been open to the pos- 
sibility that state identities need not be adversarial 
(Wendt 1992). But my analysis suggests that construc- 
tivists must be careful not to tacitly smuggle in assump- 
tions about the inherently particudaristic nature of state 
identity, assumptions that are all too tempting, thanks 
to the way in which state identity is shaped by the 
modern institution of sovereignty (cf. Jepperson et al. 
1996, 45-47, 59, note 85). Otherwise, constructivists 
risk reproducing anew the “realist” conclusions that 
had motivated their critique in the first place.* 


THE PARTICULARIST THESIS AND THE 
ANTICOSMOPOLITAN THESIS 


The particularist thesis can be stated in terms of collec- 
tive identity, solidarity, or community. It can even be 
stated in terms of cultures or collective modes of life, 
as when Anthony D. Smith (1990, 171) asserts that 


we can only speak of cultures, never just culture; for a 
collective mode of life, or a repertoire of beliefs, etc., pre- 
supposes different modes and repertoires in a universe of 
modes and repertoires. Hence, the idea of a ‘global culture’ 
1s a practical impossibility, except in interplanetary terms® 


Claims such as this, about the inherent nature of collec- 
tive modes of life, speak of a conceptual or metaphysi- 
cal truth not open to empirical falsification. 

Mouffe (2000, 213) advances the particularist thesis 
explicitly in terms of identity when she says that “col- 
lective identities can only be established on the mode of 
an us/them.” But solidarity and community are related 
to identity as well. In the ordinary sense of the word, 
community entails its members’ identification with the 
group that constitutes it and, thus, entails a collec- 
tive identity; in the stronger, moralized sense of the 
word, community also entails mutual concern among 
its members, i.e, a sense of solidarity, a disposition 
to “give each other’s interests some noninstrumenta! 


$ For a nuanced constructivist approach that avoids this problem, 


see Wendt 1999, 2003. 
6 Smith (1992, 75) makes a similar point vis-a-vis identity. 
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weight in their practical reasoning” (Mason 2000, 27).’ 
And though it is an empirical question, solidarity— 
at least the kind required for democratic politics—is 
often thought to depend on a shared collective identity 
(Cohen 1999, 263). Since it turns out to be the crucial 
version of the thesis, I focus here on the particularist 
thesis stated in terms of collective identity. 

Insofar as it is true that democratic politics requires, 
as its motivational basis, a sense of solidarity grounded 
in a shared collective identity,’ the particularist thesis 
implies that the cosmopolitan dream of a democrat- 
ically legitimated and just global political order cen- 
tered on a global human identity is in principle inco- 
herent, or impossible on metaphysical grounds. The 
anticosmopolitan argument can be reconstructed as 
follows. 


1. Collective identity (and the sense of solidarity cen- 
tered on it) is inherently particular (i.e., any collec- 
tive identity presupposes another collective identity 
that is its external other). 

2. Therefore, a global human identity (and a sense of 
solidarity centered on it) is impossible. 

3. Besides shared material interests, democratically le- 
gitimated politics requires solidarity centered on a 
shared collective identity capable of motivating such 
solidarity and joint political action. 

4. Therefore, a democratic cosmopolitan political or- 
der is impossible. 

Premise 1 is the particularist thesis (in its weak form) 
and Thesis 4 is the anticosmopolitan thesis routinely 
drawn as a consequence. Premise 3 articulates the em- 
pirical assumption that democracy requires a shared 
collective identity, or that it requires a sense of solidar- 
ity that in turn depends on shared identity. (The entire 
argument can also be restated in terms of “community” 
instead of “solidarity.”) I do not concern myself with 
Premise 3: for the sake of argument I simply take it 
for granted. My thesis is that, independent of the truth 
or falsity of Premise 3, the four-step anticosmopoli- 
tan argument fails: Premise 1 is false. While individual 
identity may indeed be inherently particular, collective 
identity is not, and the arguments for Premise 1 proceed 
only by equivocating between these two senses of iden- 
tity. There are undoubtedly many real-world obstacles 
facing the ancient cosmopolitan dream of global soli- 
darity and a community of humankind. But conceptual 
incoherence or metaphysical impossibility is not one of 
them. 


HEGEL AND TAYLOR: THE RECOGNITION 
AND DIALOGIC ARGUMENTS 


The notion that identity either conceptually or meta- 
physically requires an external other has a long history, 
a history that certainly includes Rousseau’s second 


7 See also Jaeggi (2001, 291), who argues that solidarity should be un- 
derstood as a symmetrical “form of non-instrumental cooperation.” 
8 This is another widespread assumption. See, for example, Cohen 
1999 and C Taylor 1998 
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Discours and Emile. But perhaps the most influential 
formulation of this idea in the European tradition is 
Hegel’s. I think it best to construe Hegel’s argument 
as metaphysical rather than conceptual, so I want to 
begin by first getting the conceptual argument out of 
the way. If Mouffe’s (1993, 84) claim that “every defi- 
nition of a ‘we’ implies the delimitation of a ‘frontier’ 
and the designation of a ‘them” is to be understood 
conceptually, and if the terms “we” and “them” refer 
to human beings, then the problem with Mouffe’s argu- 
ment is that its premise is false.” We can see the falsity 
of the premise by simply providing, pace the conceptual 
argument, a definition of a human “we” that does not 
designate a human “them.” Definition: Let x refer to 
all human beings. There is nothing incoherent about 
this. Now it might be objected that the definition of 
“we” in terms of humanity begs the question because 
it includes a reference to “humans,” which is precisely 
what is in question. This would be to miss the point. 
The rather simple point is that whatever the definition 
of “human being,” there is nothing incoherent about 
defining “we” such that it includes all the individuals 
picked out by the definition. Alternatively, it might be 
objected that the truth of particularism is demonstrated 
by the fact that even a universal human “we” presup- 
poses a nonhuman “them.” And indeed it does, but this 
has no bearing on the specific particularist thesis that 
human collective identities presuppose other human 
collective identities. In fact, the conceptual argument 
cannot even motivate drawing the line at human beings: 
that a universal human “we” presupposes a nonhuman 
“them” is a function of the qualifier human in “univer- 
sal human we,” not a function of we. There is nothing 
conceptually incoherent about an identification that 
goes beyond humanity, to include, for example, all sen- 
tient beings. Indeed—Eurocentric humanist intuitions 
notwithstanding—there is nothing conceptually inco- 
herent about an identification with the entire existing 
universe, such as that espoused by some pantheists. 
Much more interesting for particularists is Hegel’s 
(1977) argument. The premise of his argument is that 
selfhood involves the desire for self-certainty, that the 
self has a fundamental desire for self-consciousness 
and self-worth. The self first attempts to achieve self- 
certainty through the appropriation of external phys- 
ical objects in the world, by asserting dominion over 
them and integrating them into its plans and projects. 
But since inanimate objects can only reflect back to the 
self an image that the self already has of itself, and not 
the self-certainty it seeks, the satisfaction taken in exter- 
nal objects simply leads to new desires for new objects. 


? Though the words “definition” and “implies” here could suggest a 
conceptual reading, Mouffe’s statement can alternatively be read as 
a metaphysical claim, and m other contexts, which I treat in the final 
section, she does indeed advance the ist thesis ın meta- 
physical terms That Mouffe (2000) does apply “we” and “them” to 
human beings is made obvious by the anticosmopolitan thesis she de- 
rives from this premise (e.g., chap 2), by her references to “persons” 
(e g , 13) and the use of the second-person animate pronoun to refer 
to the other (such as “his ideas” and “his nght” [Mouffe 1993, 4P), 
and by her appeals to Carl Schmitt. I provide more detailed exegesis 
of Mouffe’s neo-Schmittianism m the paper's final section 
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This ultimately renders the desiring self-consciousness 
dependent on new objects, unable to establish its own 
independence. So self-certainty through the appropri- 
ation of nonhuman objects is inadequate for a human 
being’s sense of self-worth: self-consciousness needs an 
object that does not cease to be an object for it, an ob- 
ject capable of reflecting back to the self its own concep- 
tion of itself as free. Self-consciousness requires recog- 
nition by an other who is also a self-consctousness— 
another human being. And this recognition must be a 
mutual recognition, because if the self dominates the 
other, then the other’s recognition will not give the self 
what it requires, since by dominating the other the self 
denies to the other the freedom it must have if its 
recognition is to be worth anything (cf, Wood 1990, 
86-92). 

Taking a cue from Rousseau and Hegel, Charles 
Taylor (1994, 32-33) argues that the “close connection 
between identity and recognition” should be under- 
stood in terms of the “fundamentally dialogical char- 
acter” of human life: 


We become full human agents... through our acquisition 
of rich human languages of expression. . [learnt] through 
exchanges with others. ... The genesis of the human mind 
is in this sense not monological...but dialogical.. . We 
define our identity always in dialogue with, sometimes in 


struggle against, the things our significant others want to 
1 


see 1n us 


Taylor’s point is sound. But anticosmopolitan par- 
ticularists are mistaken to think that either Hegel’s or 
Taylor’s argument provides them any support: for the 
particularist thesis in the anticosmopolitan argument 
outlined above tacitly assumes that what is true of 
our individual identities is also true of our collective 
identities. This assumption is false. The Hegel—Taylor 
argument shows why (1) an individual is able to attain 
self-consciousness only in relation to an external other, 
dialogically. This is because either (1a) to see myself as 
a self I need to be recognized by an external other I 
also recognize as a self (Hegel), or (1b) my sense of 
self is only developed and clarified dialogically with 
external others (Taylor), or both ({1a] and [1b]). But 
it does not follow (from either [1a] or [1b]) that (2) 
the individual can only identify with a collectivity that 
excludes some set of individuals. True, if (a) we accept 
the Hegelian account, the recognition necessary to an 
individual’s sense of self-hood must be external, be- 
cause none of the constituent parts of the individual 
self would possess the independent agency needed for 
a normatively relevant act of recognition. If we grant 
Hegel’s view of self-consciousness, then the selfs own 
internal recognition of itself as a self is dependent on 
external recognition.!! But the recognition required by 
a collecttve identity can come solely from the (other) 


10 See also Habermas's (1987) discussion of Mead 

11 My point is not to endorse Hegel’s view of self-consciousness, but 
to show that even if we grant it, the ensuing recognition argument 
does not apply to collective identity Poststructuralists might reject 
the Hegelian view out of hand, arguing that Hegel’s argument that 
individual self-consciousness requires external recognition portrays 
the subject as a unified self-transparent whole, a reified portrayal that 
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individuals who make up that collectivity—an option 
that would not analogously be available for the devel- 
opment of individual identity itself In other words, a 
collective identity can receive either the external recog- 
nition of an external other or the internal recognition 
of its own constituent parts, and the latter does not nec- 
essarily depend on the former.” Similarly, if we reject 
the Hegelian view of self-consciousness but (b) accept 
Taylor’s less metaphysically demanding view that iden- 
tity is formed dialogically, then in the case of an individ- 
ual, dialogue implies an external other with whom dia- 
logue must have been at some point conducted; but in 
the case of a collectivity, the dialogue might simply take 
place among its own constituent members. Taylor’s 
argument appeals to the fact that identity formation 
occurs through a linguistically mediated process of so- 
cialization. For an individual, socialization requires in- 
teraction with external others. But socializing an in- 
dividual to identify with a collective identity could, 
rather obviously, simply occur through interaction with 
individuals who also identify with it.“ The point is not 
that the constituent members who engage in mutual 
recognition or dialogue must be paradoxically specifi- 
able prior to the constitution of the collective identity; 
the point is simply that the constitution of the collective 
identity via recognition or dialogue is not dependent on 
the existence of excluded individuals. Neither Hegel’s 
nor Taylor’s arguments necessarily commit them to the 
particularist thesis. ` 

In other words, the arguments in defense of the par- 
ticularist thesis (that only collectivities with an external 
other can be the basis for identity) suffer from a fallacy 
of composition. Whether collectivities with an external 
other are better able to win identification is an empiri- 
cal question. There are no conceptual or metaphysical 
grounds for thinking that communities “just are partic- 
ularistic,” as Miller asserts, or that “collective attach- 
ments are always affirmed in opposition to others,” as 
Schnapper assumes. 

As such, we should also reject Miller’s (1989b, 239) 
assertion that loyalty to the human race is meaningless, 
because loyalty implies partialism: 


loyalty... means favouring the interests of members of the 
group at the expense of outsiders in certain circumstances. 
That ıs what loyalty means: talk of impersonal loyalty, or 
loyalty to the human race as a whole, is meaningless.’ 


Again, the argument suffers from not being attentive to 
the distinction between individual and collective iden- 
tity. It fails to distinguish between an “other” external 


ignores the ostensibly divided, opaque, fluid, and composite nature 
of agency 

12 For a mmular argument, see Wendt 2003, 527. In Hegelian terms, we 
might say that a collectrvity could have an “immediate” (unrnuttelbar) 
existence; its ultumate mediacy may be in relation to its own parts 
Cf C Taylor 1975, 105. 

13 This is why it is perfectly reasonable to agree with the second and 
third, but not the first, of William Connolly’s (1991) claims when he 
says that “every identity is particular, constructed, and relational” 


14 Jaeggi (2001, 290-91) makes a similar claim (“To call for ‘inter- 
national loyalty’ would therefore be absurd”) and seems to suggest 
that the same point might apply to “solidarity” as well 


American Political Science Review 


Vol. 99, No. 1 
mmm OZO 


to the collectivity to which I belong and other individual 
members of the collectivity to which I belong. True, the 
latter are not “other” insofar as they belong to the same 
collectivity to which I do, but they are other insofar 
as they are not me. Loyalty to the human race could 
very simply mean sacrificing my own narrow personal 
interests in favor of the interests of the human race as 
a whole (as when, for example, I make enormous per- 
sonal sacrifices to ensure that the Earth’s air remains 
pleasant for everyone to breathe). Furthermore, if hu- 
manity is in turn constituted by smaller collectivities, 
and not just individuals, then if I professed loyalty to 
the human race, I might very sensibly mean that when 
the interests of humanity as a whole clash with the more 
parochial interests of some part of humanity to which 
I belong, I would favor the interests of humanity. To 
call this nonsensical by definitional fiat is just that— 
definitional fiat. 


THE PECULIAR PARTICULARITY 
OF NATIONAL SOVEREIGNTY 


None of this is to suggest that there are no forms of col- 
lective identity that necessarily presuppose an external 
other. There very well may be. Indeed, I would like 
to suggest that the particularist thesis that collective 
identity, solidarity, or community necessarily presup- 
poses an external other has seemed so self-evidently 
true to so many, despite being based on a patent fallacy, 
because our intuitions on this matter have been misled 
by the modern conceptions of sovereignty and nation.'5 

F. H. Hinsley (1986) has argued that the modem 
notion of sovereignty developed in tandem with the 
modern state within the context of two problématiques: 
the nature of legitimate authority within a political 
community and the regulation of relations between sep- 
arate political communities. First, the modern notion 
of sovereignty came to involve the idea that there is a 
final and absolute political authority within the political 
community (26). This is what has come to be called 
“internal sovereignty,” which Hedley Bull (1977, 8) de- 
fines as “supremacy over all other authorities” within 
a particular territory and over a particular population 
(cf Krasner 1988, 86). But second, the modern notion 
of sovereignty asserts that no other authority exists 
outside of the political community (Hinsley 1986, 26). 
This furnishes the state’s “external sovereignty,” “by 
which is meant not supremacy but independence” from 
“outside authorities” (Bull 1977, 8). As such, via the 
application of reciprocity, the notion of sovereignty 
was deployed in practice to regulate interstate relations 
through the mutual recognition of each other’s inter- 
nal sovereignty over its own territory and population 
(Hinsley 1986, 158). 

As constructivists have noted, this reciprocal recog- 
nition has become a constitutive feature of the modern 
practice and institution of state sovereignty (Ruggie 
1993, 162). As Alexander Wendt (1992) once argued, 


15 Walker (1993, chap. 8) emphasizes the crucial role of the institu- 
tion of sovereignty in the constitution of modern identities 


Sovereignty is an institution, and so it exists only in 
virtue of certain intersubjective understandings and ex- 
pectations; there is no sovereignty without an other.... 
[Sovereignty involves] a mutual recognition of one an- 
other’s right to exercise exclusive political authority within 
territorial limits. (412, my emphasis)’6 


If this is right, then at least state sovereignty would fol- 
low a particularist logic of external mutual recognition. 
(Indeed, this is Hegel’s view).!” The implication would 
be that collective identities centered on the modern 
sovereign state presuppose an external other. 

And there is a sense in which this is right. The mean- 
ing that sovereignty has acquired for us is, in part, 
constituted by its role in the Westphalian era of reg- 
ulating interstate relations. This explains why Anthony 
Giddens (1987, 281) puts the matter in conceptual 
terms: sovereignty, he claims, “only has meaning in the 
context of a reflexively regulated system larger than 
any one state.” So if sovereignty has come to mean 
two things conjoined, that is, internal sovereignty and 
the reciprocal recognition of this internal sovereignty 
among states, then sovereignty has come to be de- 
fined in a way that conceptually presupposes more 
than one sovereign power. But the combination of 
“Internal sovereignty” with “external sovereignty” is 
itself a contingent, historical, not conceptually nec- 
essary, development.’* The modern ideology of state 
sovereignty is so pervasive, of course, that the con- 
trary assumption, that internal sovereignty somehow 
entails external sovereignty, is almost as widely held as 
the particularist thesis itself (e.g., Philpott 1995, 357). 
But to say that there is, in the political community, 
a final political authority requires conceptually that 
there not be others outside of the political community 
who can override that authority; it does not require 
conceptually that there be others outside of the po- 
litical community who cannot override that authority. 
Sovereignty has come to mean, for us, both internal 
and external sovereignty, precisely because it has been 
deployed in the Westphalian era as a principle of legit- 
imation regulating interstate relations)? But a “world 
sovereign,” sovereign in the sense of being an ultimate 
political authority, is not conceptually incoherent; it 
would simply strip sovereignty of the additional mean- 
ing it has for us as an institutionalized way of regulating 
interstate relations. Indeed, Claude Lefort (2000, 33) 
argues that in France the innovation of the Middle 
Ages was precisely the delimitation of sovereignty: in- 
scribing sovereignty on a limited, bounded territory in 
formulae such as “empereur dans son royaume” was 
initially deeply paradoxical, for such formulae combine 
“the idea of imperium [previously] reserved for the 


16 Wendt (1999, chap. 5; 2003) qualifies this view significantly m later 
work. 

17 See Hegel 1991, paragraphs 321-324, 331, plus remarks, and 
Brown’s (2001, 129) reconstruction of the argument 

18 Spruyt (1994) argues that the sovereign ternitorial state, with a 
hierarchical internal authority structure and a demarcated terntonal 
Junsdiction, prevailed over other modes of political organization 
because of its capacity to prevent free-nding domestically and to 
make credible commitments externally 

19 Never perfectly, of course. On this, see Krasner 1995 
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Sovereign of the world with that of a power exercised 
within the limits of a country.” 

As Stephen Krasner (1999, 3-4) points out, the dif- 
ferent senses of sovereignty both are conceptually in- 
dependent and have been routinely disaggregated in 
practice. Indeed, the two components can be disaggre- 
gated further: for Krasner, there are four “meanings of 
sovereignty.” (1) The claim to legitimate authority and 
effective control within a territorially circumscribed 
polity (what Krasner calls “domestic sovereignty”) 
is distinct from (2) the capacity to regulate move- 
ment across the circumscribed borders (“interdepen- 
dence sovereignty”). Even (3) “the exclusion of ex- 
ternal actors from authority structures within a given 
territory,” which he calls “Westphalian sovereignty,” 
is distinct from (4) what he calls “international le- 
gal sovereignty,” or “practices associated with mutual 
recognition” between territorial entities with formal 
judicial independence. Krasner demonstrates not only 
that, in practice, states have possessed sovereignty in 
each of these senses without the others, but that in some 
cases the possession of one kind of sovereignty can ac- 
tually undermine another. It is only sovereignty in the 
sense of international legal sovereignty that concep- 
tually presupposes recognition by another sovereign. 
Sovereignty in the sense of border control requires 
an other, but not necessarily another sovereign—just 
other territory. And Krasner’s “domestic sovereignty” 
and “Westphalian sovereignty” do not conceptually 
presuppose the existence of an external other at all. 

Despite the fact that, in practice, states have pos- 
sessed the various kinds of sovereignty in different 
combinations and degrees, the modern ideology of 
sovereignty nevertheless combines all four senses 
and construes them as mutually entailing.” Given 
this (historically contingent) combination—given what 
sovereignty has come to mean ideologically—any col- 
lective identity that is hnked to this modern notion is no 
doubt prone to reproduce its requirement of external 
recognition. This is not to say, of course, that alternative 
ideologies of (say, divided or dispersed) sovereignty 
have been completely crushed—far from it. The point 
is, rather, that the ideological combination of these dis- 
tinct attributes has powerfully shaped the terms of the 
debate. 

So it is rather unsurprising that national identity has 
come to be conceived in this way as well. After all, 
a typical feature of the modern phenomenon of the 
nation is the political claim to sovereignty over some 
territorially defined space.” Benedict Anderson (1991, 
6) defines the nation as “an imagined political commu- 
nity” that is “imagined as both inherently limited and 
sovereign.” Since the modern ideology of sovereignty 
construes the sovereign as necessarily one sovereign 
among others, to say that the nation is imagined as 


2 This modern ideology has been powerfully shaped by the 
Hobbesian/Rousseauist theory of the indrvisibility of sovereignty, 
directed against the Grotian/Pufendorflan theory of the parts of 
sovereignty. See Derathé 1970, 280-94 

21 Lefort (2000, 31) remarks that without this link to sovereignty, “le 
terme ne désignerait qu’une ethnie” 
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sovereign is already to have said that it is imagined as 
limited. So it is no surprise that national identity as we 
know it today is, like the modern notion of sovereignty, 
necessarily particularist. Jonathan Rée (1998, 83) has 
even argued that identification with a nation presup- 
poses the existence of other nations: 


You cannot have a sense of belonging to the same na- 
tion as your neighbors unless you are aware of it as 
one nation among others and of an imagined totality of 
nations. ... Nations exists only in the plural, in other words, 
and if every nation but one were supposed destroyed, then 
the last one would cease to be a nation as well. 
sentiments acquire national significance only in the light 
of an imagined international order. 


Rée’s argument is that to transform local sentiments 
of attachment, which find their sources in the personal 
experiences of face-to-face interaction, into the neces- 
sarily imagined realm of the nation, one must imag- 
ine one’s nation as bounded and in contrast to other 
nations, because of the notion’s embeddedness in an 
imagined international order. But contrary to what 
the particularist might suggest, this does not establish 
that individuals can come to identify with an imagined, 
non—face-to-face community only if that community 
is particular and bounded. Rather, the point must re- 
main specific to national identity: identification with 
the nation, as opposed to just any form of imagined 
community, presupposes the existence of other nations. 
What the particularist thesis falsely claims for collec- 
tive identity as such might be true of some collective 
identities. 

To sum up, then: even the apparently necessary 
particularism of national identity depends on the na- 
tion having become invested in the modern notion of 
sovereignty. With the rise of the idea that sovereignty 
inheres in the nation, if the nation just is, by defini- 
tion, a community that is imagined as sovereign, and 
if sovereignty refers to an institutionalized way of reg- 
ulating external relations with other sovereigns, then 
national identity is necessarily particularistic. Again, 
this concatenation of ideas is historically contingent. 
Liah Greenfeld (1992, 8-14, passim) has in fact argued 
that the conception of the nation as a particular com- 
munity, i.e., one whose uniqueness requires a contrast 
with other nations, is a late, eighteenth-century, devel- 
opment. If she is correct, then it is no surprise that this 
development coincides with the rise of the doctrine of 
national sovereignty. 


SCHMITT: THE POLITICAL ENEMY 
ARGUMENT 


Does any of this show that a collective identity cen- 
tered on “humanity” could in principle be the basis 
for a democratically legitimated cosmopolitan politi- 
cal order? Such a conclusion might still be denied by 
urging that, even if collective identities as such need 
not presuppose an other, specifically political collective 
identities do. This is the thrust of Mouffe’s (2000, 43- 
44) Schmittian claim that humanity “is not a political 
concept”; it is also what motivates her endorsement of 
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Schmitt’s conclusion that “political democracy cannot 
be based on the generality of all mankind” (Mouffe 
2000, 40). Indeed, the reach of the Schmittian objection 
is not restricted to specifically cosmopolitan projects: 
the objection, if valid, would show that my argument 
about collective identities has no bearing on the polit- 
ical sphere at all. 

What I want to demonstrate is that Carl Schmitt’s 
argument, which centers on an analysis of the nature 
of sovereignty, the modern state, and “the political” 
sphere, far from providing a wholly novel justification 
for the particularist thesis, instead provides a brilliant 
illustration of how the modern ideology of sovereignty 
helps to mask the fallacy, already noted, behind the 
particularist thesis: Schmitt’s “political” argument in 
defense of the particularist thesis, and the anticos- 
mopolitan conclusion he draws from it, mirrors the 
same fallacious structure of the recognition and dia- 
logic arguments for particularism—it rests on a system- 
atic conflation of the necessity of internal and external 
“others.” 

For Schmitt (1996, 19, 47), the modern sovereign 
state is “a specific entity of the people,” “an orga- 
nized political entity” that seeks to maintain internal 
peace, is “territorially enclosed,” and is “impenetra- 
ble to aliens.” At least for the early, pre-Nazi Schmitt 
(1985b), sovereignty inheres, within the context of a 
Rechtsstaat, in the one who “decides what constitutes 
an exception” in the application of general rules to 
particular circumstances (43).~ This presupposes, with 
Jean Bodin, a distinction between laws and exceptional 
decrees or measures and, thus, between ordinary times 
and exceptional or emergency situations.” As Schmitt 
puts it elsewhere, “Sovereign is he who decides on the 
emergency situation [or exception, or state of emer- 
gency (Ausnahmezustand)|” (1922, 8; 1985a, 5).74 The 
nature of political emergency arises, for Schmitt (1996), 
from the fact that the political sphere is inherently con- 
stituted by the distinction “between friend and enemy” 
(26). “The concept of the state,” he asserts, “presup- 
poses the concept of the political” (19) and the political 
is always haunted by the spectre of existential struggle: 


to the enemy concept belongs the ever present possibil- 
ity (die thm Bereich des Realen liegende Eventualitat) of 
combat. ... The friend, enemy, and combat concepts re- 
ceive their real meaning precisely because they refer to 
the real possibility (reale Moglichkext) of physical killing. 
(Schmitt 1987, 33; 1996, 32-33) 


As Gopal Balaknshnan (2000, 148, 192) points out, 
Schmitt later abandons the sharp distinction between 
laws and emergency measures and, by the time he be- 
comes a Nazi, completely jettisons the theoretical ap- 
paratus of the Rechtsstaat: the “total” sovereign legit- 


2 Cf Schmitt 1985b, 42: “The whole theory of the Rechtsstaat rests on 
the contrast between law which 1s general and already promulgated, 
universally binding without exception, and valid in principle for all 
tumes, and a personal order which varies case to case.” 

B On the early Schmitt’s discussion of and debt to Bodin, see 
Balakrishnan 2000, 34 

44 For discussion of the state of emergency, see Balakrishnan 2000, 
44-46. 
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imately rules by decree. What remains constant, how- 
ever, is Schmitt’s commitment to viewing the sovereign 
as the one who decides matters of existential strug- 
gle. Rechtsstaat or no, sovereignty is a question of 
who “decides the extreme case (Konfliktsfall) and 
determines the decisive friend-and-enemy grouping” 
(Schmitt 1987, 43; 1996, 43). 

Schmitt’s (1996) anticosmopolitan conclusion de- 
rives from an argument whose structure is similar to 
the anticosmopolitan argument I considered earlier, 
though it begins with a stronger premise: 


The political entity presupposes the real existence of the 
enemy and therefore an other, coexisting, political entity. 
As long as a state emsts, there will thus always be in the 
world more than just one state. A world state which em- 
braces the entire globe and all of humanity cannot exist. 
The political world is a pluriverse, not a universe. (53) 


Schmitt thus introduces two modifications to the an- 
ticosmopolitan argument. First, he glosses the partic- 
ularist thesis in its stronger, adversarial form. Second, 
he embeds the particularist thesis within a specifically 
“political” context. Schmitt’s political version of the 
anticosmopolitan argument can be reconstructed as 
follows. 


1. Any political entity presupposes an adversary. 

2. An entity’s adversary is other than the entity itself 
(i.e., it is an other to the entity). 

3. Therefore, any political entity presupposes an ad- 
versary who is its other. 

4. The sovereign state is a political entity. 

5. Therefore, the sovereign state presupposes an other 
(another sovereign state who is its adversary). 

6. Therefore, a sovereign global state is impossible. 


Premise 1 is a political version of the adversary claim 
and Premise 2 reflects the exclusion claim. Thesis 6 is 
Schmitt’s version of the anticosmopolitan thesis. Steps 
1-5 constitute Schmitt’s argument for his political and 
adversarial version of the particularist thesis (Thesis 5), 
construed in terms of sovereignty. I take Premise 1 
for granted for now in order to notice the (fallacious) 
structure of Schmitt’s argument for his version of the 
particularist thesis. My thesis is that by embedding the 
particularist thesis within an argument framed in terms 
of sovereignty, Schmitt helps himself to an equivo- 
cation between the internal and external aspects of 
sovereignty; in other words, the argument conflates 
internal and external “others.” 

More particularly, the derivation of Thesis 5 (that the 
state presupposes an other state) depends on an equiv- 
ocation between two senses of “political entity” and/or 
between two senses of “other.” “Political entity” can re- 
fer to the state as a whole, that is, the entire population, 
territory, institutions, etc., within its bounded space— 
call this the comprehensive sense of political entity. 
Or it can refer to a component part of the state, for 
example, the sovereign power within it—call this the 
partial sense. Similarly, an entity’s “other” can refer to 
another entity that shares no constituent parts with it— 
call this the external (or strict) sense of “other.” (This 
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is the sense in which I have used the word so far.) Or it 
might refer to another entity whose constituent parts 
are not fully coterminous with it, i.e., an entity that 
is not identical to it—call this the expansive sense of 
“other.” So defined, the external (or strict) sense of 
other is a species of the expansive sense. One example 
of a given political entity’s other in the expansive but 
not strict sense might be another political entity some 
and only some of whose citizens (or territory, institu- 
tions, etc.) are shared with it: for example, England 
and Scotland share some citizens and institutions (the 
British Parliament) but not others (the Scottish Par- 
liament). Call this an overlapping other. Another kind 
of other in the expansive, nonstrict sense is one that 
is simply a part of the first entity: call this the encom- 
passed sense of other. For example, Scotland is Great 
Britain’s encompassed other. The converse would be 
the encompassing sense of other. 

The plausibility of Thesis 2 (that an entity’s adversary 
is other than itself) depends on construing “other” in 
the expansive sense; the restrictive external sense of 
“other” would implausibly rule out the possibility that 
the adversarial relation is between an entity and one 
of its constituent parts. Canada’s “adversary” may be 
Russia (its external other) or, as arguably often is the 
case, it may be Quebec (its encompassed other). But 
once “other” in Thesis 2 is understood in this expansive 
sense, if Thesis 3 is to be derived from Theses 1 and 2, 
then it must also be understood as leaving open the 
possibility that the adversary is an internal adversary, 
1.e., a constituent part of the political entity, an “other” 
in the encompassed sense. Hence the only way to con- 
strue, in Thesis 3, “other” in the strict, external sense 
is to assert it independently of Theses 1 and 2. But 
even if it is so asserted, then my point can be made in 
a different way: if “other” is construed in the sense 
of external other, then Thesis 3 is plausible only if 
“political entity” is understood in the partial (and not 
comprehensive) sense—as the part of the state that is 
sovereign, for example. For the sovereign’s adversary, 
as Schmitt knew perfectly well, could be a domestic 
enemy.” 

The upsbot is that Thesis 3 (that any political entity 
presupposes an adversary who is its other) only makes 
sense on one of two assumptions: either “other” is to be 
understood in the expansive sense or “political entity” 
is to be understood in the partial sense (or both). The 
problem is that in neither case can Thesis 5 be logi- 
cally derived from the premises. For the derivation of 
Thesis 5 from Theses 3 and 4 either requires constru- 
ing “other” expansively, in which case its claim that 
“the sovereign state presupposes an other” can refer 
to an encompassed other—for example, a part of the 
state who is not sovereign—or “sovereign state” (in 
Thesis 4) must be equated with the partial sense of 
political entity, as not referring to the entire state, in 
which case its strict other may be a nonsovereign within 
the political entity. The outcome in both cases is the 


5 “As long as the state is a political entity this requirement for 
internal peace compels it ın critical situations to decide also upon 
the domestic enemy (innern Fand)” (Schmitt 1996, 46; 1987, 46) 
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same, and in neither case does it follow that, as Thesis 5 
claims, the sovereign state presupposes another 
sovereign state. A sovereign state simply presupposes 
something over which it is sovereign. So even if “the 
political” depends on the friend—enemy distinction, it 
does not follow that a sovereign state’s enemy must be 
other territorially organized states. That state violence 
is always a present possibility, in other words, does not 
imply the existence of “others” external to the jurisdic- 
tion of the legitimated use of violence. 

The fallacious structure of the Schmittian political 
enemy argument for the particularist thesis mimics, 
of course, the structure of appeals to the recognition 
or dialogic arguments: while the latter appeals fail to 
see that a collective identity can be “recognized” both 
by an external “other” in the strict sense and by its 
own constituent parts, the Schmittian argument fails 
to see that a sovereign state may find an adversary 
both in an “other” state (i.e., an external other) and 
domestically. The role that “sovereignty” plays here 
in masking this equivocation between the strict and 
expansive senses of “other” is rather clear: the point is 
that Schmitt’s argument in The Concept of the Political 
rests on a systematic conflation of internal and external 
sovereignty—which of course he believes go together. 

The upshot of my critique as it stands, however, is 
rather prim for the cosmopolitan thesis that Thesis 6 
opposes: I have refuted the derivation of Thesis 6 by 
arguing that the adversary—by which Schmitt means 
enemy—of the sovereign state may be an internal ad- 
versary. And a Schmittian might be willing to concede 
this: as Balakrishnan (2000, 110) points out, at the time 
of writing The Concept of the Political in the 1930s, 
there 


were two, equally compelling modern conceptions of the 
fundamental axis of political division: one which saw his- 
tory as the history of class struggles; and another which saw 
nation-states as the subjects of all world-historical conflict. 
Although Schmitt was closer to the latter conception, he 
never dismissed the plausibility of the former 


So in principle the enemy may be an external enemy 
(other nation-states) or it may be an internal enemy 
(the proletariat or the capitalists), and “the question 
of which of these two conceptions was the politically 
decisive one could be determined only historically” 
(Balakrishnan 2000, 110-11).”° But if indeed a political 
entity presupposes an adversary, then it appears that a 
cosmopolitan political order taking the form of a world 
state could be erected only on the backs of internal en- 
emies whom the state dominates and seeks to crush.’ 
A world state would be a world domination of one part 


26 On Schmitt’s equivocation over whether the enemy must be an 
external enemy or a domestic enemy, see Derrida 1994, esp 140- 
42. See also the related equtvocation that Dernmda notes between 
Schmuitt’s external/domestic distinction and public/private distinction 
(1994, 111). 

77 Cf Schmitt 1996, 51. “The solemn declaration of outlawmg war 
does not abolish the friend-enemy distinction, but, on the contrary, 
opens new possibilities by giving an international hosts declarahon 
new content and new vigor” 
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over another—a worldwide dictatorship of the prole- 
tariat, for instance. The world state would be—to raise 
the old bogeyman once again—a terrifying nightmare. 
And if not a world state, then a cosmopolitan order 
would be an imperialist order à la the British Empire, 
imposed by a world hegemon who dominates and lords 
over its “domestic” adversaries. A cosmopolitan order 
perhaps, but not a democratically legitimated one. 

The grim conclusion depends, of course, on tak- 
ing Premise 1 (that any political entity presupposes 
an adversary) for granted. And this premise is the 
adversary claim that forms the second component 
of the stronger version of the particularist thesis. So 
if we expand the term “entity” to encompass col- 
lective identities, and if we assume that the adver- 
Sary must be an external adversary, then Premise 1 
itself turns into a strong and politicized version of 
the particularist thesis: a collective political identity 
requires not just an external other, but an external 
adversary. The implication is that my critique of the 
particularist thesis is politically irrelevant. This is the 
point of saying that humanity is not a political concept. 
By contrast, if we assume that the adversary can be an 
internal adversary, then Premise 1 yields the conclusion 
that any appeal to “humanity” as a collective identity 
can work politically only insofar as it serves to legiti- 
mate and mask the domination of one part of humanity 
over others in the false name of humanity itself. (This is 
how Schmitt suggested “humanity” is in fact used.)*8 A 
“solidarity” centered on such an identity could hardly 
play the required role in democratic legitimation, for it 
would be an oppressive, hierarchical identity serving to 
crush some who are at one and the same time human- 
ity’s “members” and lorded over by their “fellows.” We 
are talking about an imperialist sort of cosmopolitan 
“solidarity” à la the white man’s burden. 

So Premise 1 seems to imply either (A) that there 
can be no politically relevant collective identity at the 
global level—a political version of the particularist 
thesis—or (B) that such an identity would be terribly 
oppressive. Neither option allows for a democratically 
legitimated cosmopolitan political order based on a 
genuine sense of human solidarity. 

The question is why we should grant Premise 1 in 
the first place. Of course if we define “the political” in 
particularist or adversarial terms as Schmitt does—if 
the political presupposes by definition a distinction bet- 
ween “friend and enemy”—then it is hardly surprising 
if “political” collective identities are necessarily partic- 
ularistic or oppressive. But to glean a substantive argu- 
ment requires that we abandon the tautological helps. 
Schmitt’s stated motivation for defining the political 
in these terms is to provide a definition independent 
of the state: first, in order to avoid the circularity of 
defining the political and the state in terms of each 
other and, second, in order to acknowledge the possibly 


B “When a state fights its political enemy in the name of humanity, it 
is not a war for the sake of humanity, but a war wherein a particular 
state socks to usurp a universal concept against its military opponent” 
(Schmitt 1996, 54.) Schmitt’s comment is specifically about interstate 
war, but his point about the function of “humanity” 1s general. 
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political dimension of society (Schmitt 1996, 20-22; cf. 
Balakrishnan 2000, 103). With Schmitt’s goal in mind— 
to determine the relation among the political realm, 
modern state, and society—the political and the state 
should indeed not be defined in terms of each other; 
but with our goal in mind—to determine the relation 
among the political realm and the particularist thesis 
and the adversary claim—then the political should not 
be defined in terms of the particularist thesis or adver- 
sary claim. It is, as G. E. Moore might have said, always 
an “open question” whether the political is particularist 
or adversarial, and not something that can be settled 
by definitional fiat. 

Schmitt’s substantive argument for the decisiveness 
of the friend/enemy distinction, and thus for the ad- 
versary claim that every political collectivity must be 
constituted antagonistically in relation to an other 
(whether external or domestic), is Hobbesian: Schmitt 
(1996) appeals to the ineradicable possibility of exis- 
tential struggle to the death, “the real possibility of 
physical killing” (33). This possibility of killing nec- 
essarily insinuates itself into the very nature of polit- 
ically constituted collectivities: the possibility of war 
constitutes a collective political entity essentially as a 
combative collectivity, a “kdmpfende Gesamtheit von 
Menschen.” 

The question is why one possibility among others— 
and a possibility is always only one among 
others—should be determinative for political relations. 
Jacques Derrida (1994, 105-6) has argued that in seeing 
the possibility of an event as essentially constitutive of 
the political, Schmitt is forced to portray the possibility 
as an event itself Schmitt’s language constantly betrays 
this “slide from possibility to eventuality...and from 
eventuality to effectivity”: 


As soon as war is possible, it 1s presently in process, 
he seems to say . whether it be declared or not... war 
takes place, it has already begun before beginning, as 
soon as it is held for eventual....And it 1s eventual 
as soon as it is possible. . Here he invokes eventual- 
ity (wenigstens eventuell), there he invokes possibility 
(Moglichkeit).. . As soon as war 18 possible/eventual, the 
enemy is present, he 1s there, his possibility is presently, 
effectively supposed and structuring. 


It might be replied, on Schmitt’s behalf, that not all 
possibilities are equal, that some possibilities do in- 
deed structure reality, and that killing is one of these 
structuring possibilities. We can get a sense of the kind 


Ð Schmitt (1987) wmtes, for instance, that “the enemy exists 
only with an at least potentially—i.e. the real possibility of a— 
fighting collectivity of people confronting a similar collectrvity” 
[Ferd ist nur eine wenigstens eventuell, d h. der realen Moglichkeit 
nach kampfende Gesamthen von Menschen, die ener ebensolchen 
Gesamtheit gegenubersteht] (29, my translation) The published 
English translation (“An enemy exists only when, at least potentially, 
one fighting collectivity of people confronts a simular collectivity” 
[Schmitt 1996, 28]) 1s more elegant, but by collapsing the wenigstens 
eventuell, d.h der realen Moglichkeit into “at least potentally,” it 
misses the nuances in the onginal relevant to Derrida’s argument. 
As Derrida (1994, 106) points out, Schmutt’s French translator fails 
to mark the difference between wenigstens eventuell and Moglichkeit 
as well 
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of argument required to transform mere possibility 
into effectivity by returning to the Hobbesian roots 
of Schmitt’s argument. In order to show that the pos- 
sibility of killing necessarily leads to war in the state 
of nature, Hobbes’s ([1651] 1996) argument requires at 
least three premises. (a) First, Hobbes assumes that by 
nature human beings’ overwhelming motive for action 
is self-preservation. (b) Second, Hobbes assumes the 
truth of epistemological skepticism vis-a-vis the exter- 
nal world. The consequent epistemological uncertainty 
implies that, in the state of nature, we simply cannot 
know whether the other whom we confront is benign or 
a threat to our survival (and the latter is always a possi- 
bility). (c) Third, Hobbes structurally defines the state 
of nature as a state where there is no overriding com- 
mon sovereign power who can guarantee one’s security 
against potential aggressors (through the enforcement 
of common rules). Hobbes’s conclusion is that (a) the 
motive for self-preservation, combined with (b) our 
epistemological weakness and (c) structural lack of 
sovereign power, renders each of us “diffident” and 
thus prone to engage in preemptive strikes. This, then, 
is how the possibility of killing infallibly transforms 
itself into a reality for Hobbes. 

The problem for Schmitt is twofold. First, the con- 
clusion does not quite follow from the premises: when 
confronted with an other being, my epistemological 
weakness in fact faces three logical possibilities, not 
two. The other may be benign, or a threat to my sur- 
vival, but for all I know the other might also be a willing 
and cooperative aid to my survival—indeed, his or her 
benevolence may turn out to be indispensable to it. So 
it is not clear why epistemological uncertainty should 
not lead to hope, rather than diffidence. This is pre- 
sumably why Hobbes adds another assumption about 
human nature to get humans on a war footing: he needs 
to assert that (d) by nature humans seek glory. The 
second problem for Schmitt is that once we abandon 
any of the four premises, Hobbes’s conclusion does not 
follow. To say nothing of Hobbes’s assumptions about 
human nature, and the skepticism that Schmitt does 
not share, the structural assumption (c) that Hobbes 
needs to make demonstrates the way in which the ef- 
fectiveness or realization of a possibility is dependent 
on contingent features of empirical reality. 

In other words, in order to construe the possibility of 
killing as an eventuality, if not inevitability, we need to 
engage in two reifications. First, we must explain human 
behavior in terms of question-begging assertions about 
human nature. Such assertions, if themselves based 
on observations of human behavior or psychology, 
are question-begging because, as Rousseau famously 
pointed out in his second Discours, the characteristics 
we attribute to human nature may be the effect, not 


3 Hobbes ([1651] 1996) gives a succinct statement of his argument, 
which moves from the possibility of killing to the incentives for 
preemptive stnkes and the inevitability of the state of war m the 
state of nature (87-89). On Hobbes’s motivational assumption of 
self-preservation (as opposed to the more widely attributed motive 
of self-interest) and on his epistemological skeptiasm, see Tuck 1989 
and Tuck's introduction in Hobbes (1651) 1996 
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the cause, of sociopolitical phenomena.*! Second, we 
would need to take for granted precisely those contin- 
gent structural features of reality that should be the 
object of empirical investigation. By transforming a 
contingent, empirical possibility into a metaphysically 
determinative necessity, Schmitt has simply reified the 
contingent causes of antagonism. Not only does such 
reification illegitimately constrain normative theory by 
confronting it with fake metaphysical barriers, it also 
hampers the development of realistic empirical theo- 
ries seeking to explain the contingent causes of antag- 
onism and war. The question of when and under what 
circumstances the possibility of existential struggle is 
transformed into actual compat GT even relations of 
antagonism, is an empirical one. 


MOUFFE: THE DERRIDEAN DIFFERENCE 
ARGUMENT 


Chantal Mouffe provides a more promising argument 
for the political version of the particularist thesis and 
adversary claim espoused by Schmitt. She argues that 
(a) “any social objectivity is constituted through acts 
of power” and (b) “has to show the traces of the acts of 
exclusion which govern its constitution” (Mouffe 2000, 
21). The point applies to a collective identity insofar 
as it has “objective” social existence, i.e., insofar as 
it manifests itself in social effects. Elsewhere Mouffe 
makes what is apparently a related point: “[a’] Politics 
aims at the creation of unity in a context of conflict 
and diversity; [b’] it is always concerned with the cre- 
ation of an ‘us’ by the determination of a ‘them”’ (101). 
The point seems to be that (a’) in the face of conflict, 
(a) collective identities are able to unify individuals 
only via the exercise of power. And (b) this exercise 
of power itself necessarily does “violence” to (some?) 
individuals, for such power can never be fully subject 
to rational justification. As such, from any given forged 
unity a remainder of exclusionary violence inevitably 
escapes: “the impossibility of establishing a rational 
consensus without exclusion” (45) follows from the 
fact that “difference is the condition of the possibil- 
ity of constituting unity and totality at the same time 
that it provides their essential limits” (33). The upshot 
is, apparently, that (b’) every collective identity simul- 
taneously and necessarily excludes some individuals 
from its constitution. 

I find the points that Mouffe makes in (a), (b), and 
(a’) quite plausible. The problem lies with (b’), the up- 
shot. I see two ways of reading this upshot. 

(A) The first way assumes that the “them” required 
in the construction of “us” refers (1) to actually exis- 
ting particular individuals who are (2) denied mem- 
bership. It assumes, in other words, that Mouffe is a 


31 That Schmitt reifies the possibility of existential struggle in terms 
of human nature is certain it 1s precisely this posstbulity of existential 
struggle to which Schmitt (1996) is referring when he asserts that 
“all genuine political theories presuppose man to be evil, ı e , by no 
means an unproblematc but dangerous and dynamic being” (61). 
2 For a similar argument applied to state behavior ın international 
relations, see Wendt 1992. 
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neo-Schmittian seeking to defend Schmitt’s version of 
the particularist thesis. 

That Mouffe does indeed subscribe to Schmitt’s po- 
sition here is clear. She explicitly follows Schmitt in 
claiming that antagonism “is inherent to all human re- 
lations,” that this antagonism is the defining feature of 
“the political” (Mouffe 2000, 101), and that the process 
of collective identity formation can only be understood 
10 light of this fact: 


In the domain of collective identifications, where what is 
in question is the creation of a ‘we’ by the delimitation 
of a ‘them’, the possibility always exists that this we/them 
relation will turn into a relation of the friend/enemy type. 
(Mouffe 1993, 2-3). 


While here Mouffe (1993) suggests that the enemy- 
relation 1s a possibility, elsewhere she claims that it is in 
fact a necessity: “to construct the ‘we’ it must be distin- 
guished from the ‘them’, and that means establishing 
a frontier, defining an ‘enemy”’ (69). Mouffe’s argu- 
ment, in other words, reproduces precisely the same 
equivocation we have already seen in Schmitt. And lest 
there be any doubt that the “enemy” whom Mouffe 
considers necessary for collective identity formation 
refers to concrete, actually existing human beings, she 
notes that “it is useful to remember with Carl Schmitt 
that the defining feature of politics is struggle and that 
“There always are concrete human groupings which 
fight other concrete human groups...” (113; quot- 
ing Schmitt 1985b, 67). She does, however, make one 
modification to Schmitt’s use of the term “enemy.” She 
distinguishes between two kinds of enemy: an enemy 
proper, whom one antagonistically seeks to destroy, 
and what she calls an “adversary,” which is an enemy 
whom one agonistically combats but whose existence 
is deemed legitimate (Mouffe 1993, 4). 


The novelty of democratic politics is not the overcoming 
of this us/them opposition—which is impossible—but the 
different way in which it is established... the aim of demo- 
cratic politics is to construct the ‘them’ ın such a way that 
it 18 no longer perceived as an enemy to be destroyed, but 
as an ‘adversary’. (Mouffe 2000, 101-2, my emphasis) 


The fact that an enemy can take the form of a legit- 
imate adversary makes democracy possible; but the 
fact that it is “impossible” to form collective identity 
except in relation to a concrete other qua enemy makes 
cosmopolitan democracy impossible: Schmitt “is right 
to say that a political democracy cannot be based on 
the generality of all mankind, and that it must belong 
to a specific people” (Mouffe 2000, 40). 

If (b’) is read in this light, then Mouffe is advancing 
the Schmittian particularist thesis: the enemy necessary 
for collective identity formation must be an external 
other consisting of concrete individuals excluded from 
membership. This would explain why democracy be- 
longs, as Mouffe says, to a “specific”—i.e., particular— 
people.” 


B Cf Mouffe 2000, 4: “As my discussion of Cari Schmitt's 
theses... makes clear, democratic logics always entail drawing a 
frontier between ‘us’ and ‘them’, those who belong to the ‘demos’ 
and those who are outside 1t.” 
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If this is the correct reading, then my point is simply 
that conclusion (b’) does not follow from Mouffe’s ar- 
gument. Some might, for example, be sympathetic to 
(a), (b), and (a’) for the reasons that Paul Ricoeur gives 
for what he calls the “tragedy of all human action”: 
beginning with the pluralist premise that there exist 
a plurality of incommensurable ends—which implies 
that “the historical realization of one value cannot be 
secured without harm to some other value”—he con- 
cludes that “in action, one must choose, hence prior- 
itize, hence exclude.” The upshot, for Ricoeur, is that 
every constitution involves exclusion, which consists 
in the compromise of some binding values in order to 
secure others ™% Now, one might argue in defense of 
Mouffe that “exclusion,” whether of the value-conflict 
sort Ricoeur has in mind or not, will invariably be tar- 
geted at particular individuals. And this may be true; 
but exclusion need not consist in the denial of member- 
ship to particular individuals, which is what the particu- 
larist thesis claims. The exclusionary violence to which 
Mouffe refers in (b) might, for example, take the form 
of including the targeted individuals as members by 
forcibly excluding (i.e., eliminating, suppressing, priva- 
tizing, reeducating, etc.) characteristics that fail to fit 
into the mold—the way that non—Parisians in France 
were included and assimilated into the French nation, 
for instance (Weber 1976). (Yes this occurred against 
the backdrop of “Spaniards” and others who were not 
members; my point is simply to illustrate what I mean 
by “exclusion” in nonmembership terms.) The Schmit- 
tian particularist thesis does not follow from Mouffe’s 
argument. 

(B) Of course it might be replied that if every forging 
of a unity mevitably produces exclusions, individual 
targets of exclusions are effectively denied full mem- 
bership, even if formally included: some ostensible 
“members” invariably must be incarcerated, socially 
marginalized, or perhaps simply misrecognized. This 
glosses Schmitt’s Premise 1 (that every political entity 
presupposes an enemy) in terms of domestic oppres- 
sion, and it provides the second way of reading the 
upshot (b’) of Mouffe’s argument. While the first read- 
ing assumed that the enemy Mouffe deems necessary 
for collective identity formation must be an external 
other consisting of concrete individuals excluded from 
membership, the second reading assumes that the en- 
emy can also be an enemy internal to the collective 
itself. We have already sighted this spectre of a domes- 
tic enemy in Schmitt’s work; on this reading, collective 
identity presupposes an other, but it is an other in the 
expansive sense of the term. 

What is common to both the strict and the expansive 
readings of Schmitt and Mouffe, however, is that the 
enemy is supposed to consist in an empirically specifi- 
able group(s) of actually existing concrete individuals. 
And therein lies the problem. We can begin to see 


4 “Dans l'action, ıl faut chor, donc préférer, donc exclure . 
chaque constitution exprime une échelle de pnontés uréductibles 
l'une à l’autre en vertu de raisons contingentes, tributaires d'une con- 
joncture géographique, historique, sociale et culturelle, non transpar- 
ente aux agents politiques du moment” (Ricoeur 1991, 169-70) 
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the problem by turning to Mouffe’s more explicit ar- 
gument for the adversary claim and the necessity of 
the other-relation (whether in the strict or expansive 
sense). Mouffe seeks to defend her Schmittian conclu- 
sions by appeal to an intellectual tradition whose point 
of departure is not Schmitt, but Derrida’s engagement 
with Saussure and Nietzsche: she appeals to the related 
Derridean categories of “difference” and the “consti- 
tutive outside.” 

Derrida’s (1967, chap. 2) notion of the constitutive 
outside emerges in the context of his critique of struc- 
turalist linguistics Mouffe has sought to generalize 
Derrida’s argument about language to identity forma- 
tion, and she is not the only one to read Derrida in this 
way (e.g., Wenman 2003, 60; cf. Laclau 1996). What is 
particular to Mouffe is the argument that “the notion 
of the ‘constitutive outside’, borrowed from Derrida, 
can help us elucidate” her explicitly Schmittian theses. 
“One of Derrida’s central ideas,” Mouffe (1993) claims, 


is that the constitution of an identity is always based on 
excluding something and establishing a violent hierarchy 
between the resultant two poles... . This reveals that there 
is no identity that is... not constructed as difference, and 
that any social-obyectivity is constituted through acts of 
power. It means that any social objectivity is ultimately 
political and has to show traces of the exclusion which 
governs its constitution. (141) 


Thus the notion of the constitutive outside helps expli- 
cate 


the antagonism inherent in all objectivity and the centrality 
of the us/them distinction ın the constitution of collective 
political identities. This arses when this us/them relation, 
which until then was only perceived as simple difference, 
began to be seen as one between friend and enemy. From 
that moment on ..it becomes political (in Schmutt’s sense 
of the term). (Mouffe 2000, 13) 


The constitutive outside is supposed to map onto the 
“them” that Mouffe thinks is constitutively necessary 
for a collective “we,” which in turn supposedly maps 
onto the Schmittian friend/enemy relation. The move 
from Derrida to Schmitt requires four steps: (1) identity 
presupposes difference, which means that difference 
is constitutive of identity; (2) therefore, any identity 
constitutively presupposes an other different from it; 
(3) this relation to an identity’s constitutive other is 
inherently characterized by antagonism; and (4) in the 
case of collective identity formation, the categories of 
“identity” and “other” map onto sets of actually exist- 
ing concrete individuals, viz., friend and enemy. I have 
already discussed Derrida’s critique of the conflation 
of possibility and eventuality (and the reification of the 
causes of antagonism) involved in the Schmittian ad- 
versary claim contained in step (3). Here I focus on step 
(4), which is wholly incompatible with the Derridean 
argument about difference. 

To see this, we need to attend to the specifics of 
Derrida’s argument. Derrida’s (1967, chap. 2) well- 
known thesis is that the “inside” of a supposedly closed 
conventional linguistic structure—what in Saussure’s 
structuralist linguistics ıs called a langue—presupposes 
an “outside” that both is necessary for the closure of the 
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linguistic structure and makes closure impossible. For 
Saussure (1960), a sign consists in the relation between 
a concept (signifié) and an acoustic image (signifiant). 
Saussure argued that the relation between a signifié 
and a signifiant with which it is associated in a linguistic 
structure is arbitrary and is wholly determined (as a 
matter of convention) by the sign’s position within the 
totality of the linguistic structure. Meaning, therefore, 
does not derive from the relation of an atomistic sign to 
a nonlinguistic entity (such as a mental state or external 
object); rather, a sign derives its meaning solely from 
its relation to (and difference from) all the other signs 
within the totality of the linguistic structure itself.” For 
structuralist linguistics, then, the wholeness of a langue, 
and its wholly closed character, is necessary to explain 
both how the sign relation can be arbitrary and how 
the communication of linguistic meaning is nonetheless 
possible. Derrida (1967) has argued that this closure of 
the linguistic system, on which structuralist meaning 
depends, conceptually and so constitutively requires 
drawing a boundary around the langue, i.e., the delimi- 
tation of what is interior from what is exterior to it (49- 
53, 65-66). The problem is, according to Derrida, that 
such closure, and the determination of an outside— 
both of which are necessary for meaning—would also 
make meaning impossible. Within such a closed sys- 
tem, since each sign must be defined in relation to 
the totality of other signs within the same linguistic 
structure, its meaning is subject to a vicious circular- 
ity: if the meaning of sign a is given by its differ- 
ence from b (etc.), and the meaning of b (etc.) is in 
turn given by its difference from a, then the mean- 
ing of a is ultimately dependent on the meaning of 
a itself, which is precisely what is in question. While 
the location of each sign within the structure of re- 
lations is fixed, its meaning would remain empty of 
content because ultimately self-referential Hence 
the need to refer to something outside of the linguis- 
tic structure. Thus the “outside” of a linguistic struc- 
ture is constitutive of a langue in two (contradictory) 
ways: first, this outside is required to constitute the 
boundaries that give closure to the totality of the lin- 
guistic structure; second, this outside must penetrate 
that structure in order for meaning to be possible in 
the face of a vicious, self-referential circle. This requires 
the signification of something outside of the putatively 
closed system of signification. That what is supposedly 
“outside” is actually constitutive of the “inside” in 
this way fatally calls into question, according to the 


35 As Talbot Taylor (1992, 85) notes, “As a system of signs langue 
consists in the matching not of indrvidual concepts with individual 
acoustic images, but of two structured domains: that of concepts dif- 
ferentially defined and that of acoustic images differentially defined ” 
Each signifié, in other words, derives its meaning from its difference 
with all other signufiés in the linguistic structure, and each signsfiant 
1s negatively defined by its contrast with all other signsfiants. 

36 As T. Taylor (1992, 166) puts it, on Dernda’s view, “In order to 
interpret a sign ..an interpreter must consider those ngnufients and 
sigrufiés to which that sign’s own signifiant and signifié are opposed, 
but even to identify those signufiants and signifiés she must consider 
the sigrufiants and signiiés to which they, ın turn, are opposed (and 
so on ad mfinitum).” 
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Derrida, the distinction between inside and outside, on 
which structuralist linguistics depends.” 

The upshot is that, even if Derrida’s argument about 
language maps onto an argument about collective iden- 
tity formation—even if collective identity constitu- 
tively presupposes difference—the Derridean category 
of the constitutive outside is not coterminous with 
an empirically specifiable set of concrete individuals. 
Even granting the Derridean argument, and even if 
“difference is the condition of the possibility of con- 
stituting unity,” as Mouffe (2000, 33) urges, it does not 
follow that collective political identities presuppose— 
or that unity can only be forged on the basis of—an 
“other” consisting of actually existing individuals. 
Derrida’s argument does not even support the weaker 
version of the particularist thesis, much less the 
stronger Schmittian one that adds the adversary claim. 
To derive Mouffe’s conclusion to the contrary is to 
confuse the Derridean language of difference with the 
Schmittian language of alterity. From the Derridean 
insight it does not follow that every “us” requires an 
actually existing “them” in Mouffe’s sense. 

So it is no surprise that Derrida himself attacks 
Schmitt’s friend/enemy argument on precisely this is- 
sue. The slide in Schmitt’s argument, from the possi- 
bility of the enemy-relation to its putative necessity, is 
important for Derrida not only because its exposure 
undermines Schmitt’s defence of the adversary claim 
(step (3) above), but also because to see the possibility 
of the event of war as an event itself requires a prior 
identification of who the enemy is, i.e., it presupposes 
that the enemy consists in an identifiable, recognizable, 
concrete set of individuals, a kdmpfende Gesamtheit 
von Menschen. As Derrida (1994) puts it, it is ab- 
solutely crucial for Schmitt that there be no doubts 
plaguing the knowledge of “who is the friend and who 
is the enemy” (136). And this takes us to the divided 
heart of the matter as Derrida sees it: the difference 
that constitutes identity constitutes identity and so pen- 
etrates it so that, ultimately, not only is difference in- 
ternal to the identity itself, but the distinction between 
what is internal to the identity and what is external to 
it is always breaking down.” And if the “constitutive 
outside” is constitutive of what is inside—if even con- 
ceptually there is no pure “inside”—then the (impure) 
conceptual distinction between friend and enemy (or 
between “us” and “them”) cannot map onto existing, 
empirically identifiable bounded groups of individuals 
except through ideological obfuscation. This is because 
any act of identifying the “friend” will inevitably find 


37 “Te système de l'écriture [which ıs the putative “outside” to 
a langue] en général n'est pas extérieur au système de la langue 
en général, sauf s Pon admet que le partage entre l’exténeur 
et Pintériour passe à |’inténeur de l'nténeur ou à |’exténeur de 
l’exténeur, au pomt que |’ummanence de la langue soit essentielle- 
ment exposée à l’intervention de forces en apparence étrangères à 
son système” (Derrida 1967, 63) 

8 This is a point that Mouffe herself wants to make as well; the 
problem is not that she fails to make it, but that she fails to see its 
incompatibility with her Schmittian conclusions. Her collaborator 
Ernesto Laclau (1996, chaps 2, 4) also makes the same point about 
the instability of the inside/outside distinction, but he does not invoke 
Schmitt 
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within its members characteristics that supposedly be- 
longed solely to the category of the “enemy.”?? “The 
contrary of the political inhabits and politicizes the 
political” (160). On Derrida’s deconstructive reading, 
then, the enemy is “other” only in the expansive sense 
that allows for the possibility of an other who inhabits 
the inside. Indeed, ultimately the enemy is “other” only 
in the radically expansive sense that breaks down the 
distinction between self and other: the enemy is within 
one’s own self, which is, as Derrida points out, the con- 
clusion to which Schmitt himself comes as a prisoner 
after World War I (187). This is why, Derrida claims, 
Schmitt begins by strictly defining the political in terms 
of antagonism against an external enemy, but ends up 
internalizing the friend—enemy relation, claiming that 
the lack of an external enemy would simply internal- 
ize war by directing it toward domestic enemies (140- 
42). 

One need not endorse either Derrida’s reading 
of Schmitt or his substantive philosophical argument 
about difference to see that his argument cannot be 
used to justify either the exclusion or the adversary 
claim. There are three distinct reasons why difference 
does not map onto an empirically identifiable concrete 
“other” consisting of actually existing persons excluded 
(whether formally or effectively) from membership.“! 
(1) First, even if identity presupposes difference, dif- 
ference can exist within the putative inside. (2) Second, 
difference within the putative inside actually calls into 
question the sharp distinction between inside and out- 
side. (3) Third, difference need not refer to actually 
existing persons at all: it can refer to nonhumans, it can 
refer to characteristics rather than individuals, and it 
need not refer to actually existing things at all. And 
the difference argument does not justify the adversary 
claim because (4) there are no metaphysical grounds 
for thinking that a relation of difference is necessar- 
ily a relation of antagonism (i.e., for thinking that the 
possibility of war is always effective and structuring). 

The result is that the difference argument cannot 
service Mouffe’s neo-Schmittian anticosmopolitanism, 


P Derrida (1994, 134) refers to “I’inaccessibilité de la frontière 
(le partage entre les concepts autant que le partage entre le 
dedans et Je dehors du corps politque...)” and concludes that “en 
pratique .cette différence entre les différends n’a jamais lieu On 
ne la trouve jamais Jamais concrètement Introuvable par conséquent 
demeure la pureté .de lennemi par laquelle Schmitt entend définir 
le poltique.. Aucun événement politique ne peut être correcte- 
ment décrit ou défini à l'aide de ces concepts.” The ensumg doubt 
about who 1s the fnend and who 1s the enemy generates, according to 
Derrida (136), “la récurrence compulsive et obsessionnelle du mot 
«concret»” in Schmitt’s text. 

© See Schmutt 1996, 51, quoted in footnote 27. 

41 Mouffe’s (2000, 12) own language, when she is consciously attend- 
ing to the Derridean current of her argument, actually betrays this: 
she refers, for example, to the “positivity” of the inside being “a 
function of the symbol of something exceeding it”—and a symbol, 
of course, is not the same thing as actually existing persons. (Schmitt 
(1996) himself ıs emphatic on this pomt. “The fnend and enemy con- 
cepts are to be understood ın their concrete and existential sense, 
not as metaphors or symbols” [27-28].) Persons can be symbols, 
to be sure, and such excluded persons often are the symbols that 
help constitute collective identities, but this 1s hardly a conceptual or 


metaphysical necessity. 
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on either the strict or the expansive reading: either she 
means to say that (A) democracy requires an external 
other qua enemy, in which case she has provided no 
support for the particularist thesis on which that claim 
is predicated, or she means to say that (B) democracy 
requires an enemy whether external or domestic, in 
which case her point is not specific to global democ- 
racy—to say nothing of the fact that the Schmittian de- 
fense of this adversary claim conflates possibility and 
effectivity. The problem of power—in particular, the 
exclusionary violence that the constitution of any unity 
leaves as a remainder—is not a problem specific to 
cosmopolitan democracy. If one can conceive of demo- 
cratic possibilities at the subglobal level—as Mouffe 
does, which is the point of her critique of Schmitt— 
then to bar the possibility of a global identity under- 
girding a democratically legitimated cosmopolitan or- 
der requires asserting the particularist thesis found in 
the strict reading (A). And the problem here is that— 
difference argument or not—the particularist thesis 
does not stand to reason: cosmopolitan solidarity or 
identity does not require an actually existing external 
other in contrast to which it must constitute itself 


CONCLUSION 


The particularist thesis has tempted not just theorists 
directly influenced by Schmitt, but also those under 
the spell of nationalist ideology: what Schmitt shares 
in common with nationalists is that both view political 
life in terms of the ideology of sovereignty. The upshot 
is that when political claims are made for collective 
identity groups, they are viewed, in light of the nation- 
alist model, as demands for sovereignty in the modern 
ideological sense, a sense whose imaginary demands an 
external other. 

But theorists who ground their anticosmopolitanism 
in the widely held belief that collective identity inher- 
ently presupposes an external other are quite mistaken 
to think that their position rests on a self-evident con- 
ceptual or metaphysical truth. Neither the recognition 
argument, the dialogic argument, the political enemy 
argument, nor the difference argument provides any 
support for the particularist thesis. The political enemy 
argument fails because the argument itself is unsub- 
stantiated: it either depends on definitional fiat, and 
so begs the question, or it depends on problematic 
metaphysical assumptions about the effectivity of the 
possible that reify the causes of antagonism. The other 
three arguments rest on claims that are more plausible: 
identity may indeed presuppose recognition, dialogue, 
or difference. But, contrary to what a cursory examina- 
tion might suggest, none of these three claims grounds 
the particularist thesis: the sources of recognition and 
dialogue required in the formation of a collective iden- 
tity need not be humans excluded from membership; 
and difference need not refer to actually existing indi- 
viduals excluded from membership either.” 


© The three arguments are not necessarily mutually compatible. The 
Dernidean difference argument calls into question Hegel’s metaphys- 
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While particularists who appeal to the recognition 
and dialogic arguments are guilty of a fallacy of com- 
position, those who ground the particularist thesis by 
appeal to “difference” miss the fact that difference and 
otherness can be constructed not just spatially, as it 
were, but also imaginatively and temporally (cf. Wendt 
2003, 527). We can see this oversight in Chris Brown’s 
(2001, 131) argument: 


Identity is about difference. There is no reason why a Eu- 
ropean identity could not gradually supersede British or 
French or Portuguese identities, but there 1s every rea- 
son why a global identity could not supersede European 
or North American or Japanese identities. Such a global 
identity would have no borders, no frontiers, no sense of 
the Other. 


To be sure, a collective identity might be formed in 
contrast to, or even in combat with, an actually existing 
external other excluded from its membership. But it 
might also be constructed on the basis of difference 
from hypothetical values and the imagined collective 
identities centered on them, or on the basis of differ- 
ence from the values of a past historical identity from 
which one wishes to mark one’s distance. One way 
to construct European solidarity is to ground it in a 
Christian-based identity defined in sharp contrast to 
Europe’s Islamic frontiers; a second way is to construct 
European identity in terms of the political values and 
institutions of human rights, religious toleration, polit- 
ical freedom, democracy, and so on, that seem increas- 
ingly entrenched in Europe since the end of World 
War II. The contrast, in the second case, might be 
with Europe’s own history, and with a lingering past 
that it seeks to leave behind. If identity does indeed 
presuppose difference, the first mode of constitution 
is not open to cosmopolitan solidarity, but the second 
is: humanity’s own past provides a rich and terrifying 
repository in contrast to which cosmopolitan identity 
could constitute its “difference.” 

None of this is to deny the empirical fact of actual 
exclusion, antagonism, and conflict in our world. It is, 
indeed, an empirically observable phenomenon that 
collective identities often do constitute themselves via 
the exclusion of external others from membership.” 
Perhaps there are even empirical (psychological, socio- 
logical, economic, and political) reasons why collective 
identities, particularly political collective identities, are 
most easily formed in this way—at least in the world 


ical assumptions, about the transparent and holistic nature of self- 
consciousness, which undergird the Hegelian recognition argument. 
43 For these two nval myths of European identity and their political 
implications, see Riekmann 1997 

M Enksen (1995) provides empirical evidence for this phenomenon, 
even though, like many others, he thinks the empirical phenomenon 
manifests a deeper conceptual necessity. “It is an obvious fact,” his 
article begins, “that every community is defined ın relation to [a] 
that which it is not; that is, [b] outsiders, aliens, non-members of 
the group. Social identities and groups are by default [a] relational 
in the sense that they are defined [b] ın relation to other identities, 
other groups” (427) Notice the illicit slide from (a) the assertion that 
identity requires contrast or difference to (b) the assertion that this 
contrast must be with a concrete other. Only the second assertion 
amounts to the particularist thesis 
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as we know it (Mercer 1995). I have already suggested 
one reason for this: the fact that contemporary politics 
is dominated by the modern ideology of sovereignty. 
But one must be careful not to redescribe the empiri- 
cal phenomena in terms of conceptual or metaphysical 
necessity. Chantal Mouffe’s work is perhaps the most 
robust attempt by a political theorist to collapse the 
empirical and conceptual/metaphysical in this regard, 
and it is in part successful, at least on the surface, be- 
cause of existing phenomena that make such a descrip- 
tion of political dynamics seem accurate. The problem 
with such a misdescription, however, is that its only 
function is to reify the politically generated dynamics 
that, by reference to other possibilities, it is the task of 
normative political theory to call into question. There 
is quite literally a world of difference between Miller’s 
(1989a, 67-68) claim that “communities just are partic- 
ularistic,” as if we were confronted with a conceptual 
or metaphysical truth, and his more careful assessment 
elsewhere that communities tend to be particular for 
empirical reasons.* While a merely empirical observa- 
tion that collective identities have usually been formed 
contrastively would certainly raise difficulties for cos- 
mopolitanism, those difficulties might turn out to be 
dependent on contingent (say, structural) features of 
the current sociopolitical order—and hence in princi- 
ple surmountable. The falsity of the particularist thesis 
is crucial here because the anticosmopolitan thesis that 
a democratic order centered on a global human identity 
is impossible depends on a feasibility limit that only 
such a categorical conceptual or metaphysical claim 
about the inherent nature of collective identity pro- 
vides. 

So if applying the Hegelian, Schmittian, or nation- 
alist paradigms to the process of collective identity 
formation inevitably leads to a conceptual or meta- 
physical misdescription of the empirical phenomena, 
then overcoming such reification may require rethink- 
ing these processes within a different theoretical appa- 
ratus. Only then might it be possible to better under- 
stand under what particular sociopolitical conditions 
collective identities are likely to be formed through 
opposition to an external other and, by contrast, under 
what conditions internally generated collective identi- 
ties are formed. Thus I see a complementary division 
of labor between political theorists and philosophers, 
on the one hand, and empirical social scientists, on the 
other. It is the primary task of the former to clarify, 
in light of the empirical evidence, the theoretical pos- 
sibilities and systematically to assess their desirability; 
it is the primary task of the latter to investigate the 
empirical conditions under which the various possi- 
bilities might be realized.“ Reifying the Westphalian 





45 “It would be too strong to say that it 1s necessary to the very idea 
of community that there must be outside communities in compet- 
tion, so to speak, with the one with which I identify, Nevertheless, 
there seems to be a well-supported empirical connection between 
the strength of people’s attachment to their own community and 
therr awareness of rival collectivities” (Miller 1989b, 232-33). 

46 For a good example of the latter, see Fearon and Laitin 2000, 
who provide an overview and synthesis of some recent literature on 
ethnic identity formation and violence. 
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order in conceptual or metaphysical terms, to the detri- 
ment of the future possibility of a democratically legit- 
imated cosmopolitan political order, is not a gift that 
a critical political theorist should just hand over to the 
nation-statist status quo. It certainly does not need the 
help. 
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Insider—Outsider Politics in Industrialized Democracies: 
The Challenge to Social Democratic Parties 


DAVID RUEDA University of Oxford 


n much of the political economy literature, social democratic governments are assumed to defend 

the interests of labor. The main thrust of this article is that labor is divided into those with secure 

employment (insiders) and those without (outsiders). I argue that the goals of social democratic 
parties are often best served by pursuing policies that benefit insiders while ignoring the interests of 
outsiders. I analyze Eurobarometer data and annual macrodata from 16 OECD countries from 1973 to 
1995. I explore the question of whether strategies prevalent in the golden age of social democracy have 
been neglected and Left parties have abandoned the goal of providing equality and security to the most 
vulnerable sectors of the labor market. By combining research on political economy, institutions, and 
political behavior, my analysis demonstrates that insider-outsider politics are fundamental to a fuller 
explanation of government partisanship, policy-making, and social democracy since the 1970s. 


that social democratic parties are the defend- 

ers of labor. The persistence of widespread un- 

employment witnessed under social democratic 
governments since the early 1970s, however, power- 
fully conflicts with this assumption. Moreover, the pop- 
ular press has reported with increasing frequency that 
the distinctiveness of some of the economic policies 
once championed by social democratic and conser- 
vative parties has been lost. How can these seeming 
anomalies be explained? The answer, I argue, lies in 
challenging the notion that social democratic govern- 
ments represent the interests of labor. 

The traditional conception of social democratic 
policy-making rests on the assumption that labor is 
affected disproportionately by unemployment. But in 
the following pages I argue that labor is divided into 
two segments: those with secure employment (insiders) 
and those without (outsiders). Since the early 1970s, in- 
siders have become insulated from unemployment. Not 
only do they enjoy high levels of protection, they also 
benefit from the fact that outsiders act as a buffer bear- 
ing the brunt of fluctuations in the business cycle. In 
response to the increasingly significant differences be- 
tween insiders and outsiders, social democratic govern- 
ments have transformed their policy goals. The anoma- 
lies mentioned above can be explained by considering 
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that the objectives of social democratic governments 
are best served by pursuing policies that ignore the 
interests of outsiders. 

Disaggregating labor into insiders and outsiders pro- 
motes the exploration of three topics of importance to 
the comparative political economy literature. The first 
one has to do with the transformation in party strate- 
gies resulting from new voter demands in industrialized 
democracies. While the relevance of other factors (like 
lower economic growth, demographic or production 
changes, the emergence of post-Fordism, increasing 
internationalization, and competition from industrial- 
izing countries) has been recognized for some time, 
my analysis makes clear the significance of insider- 
outsider preferences as a determinant of government 
policy. The second topic is related to the very nature of 
social democracy. Our assumptions about the strategies 
of leftist parties have not changed substantially since 
the golden age of social democracy (when equality, so- 
cial protection, and economic growth were perceived 
as compatible). My analysis questions these assump- 
tions and provides a fuller understanding of the limi- 
tations and opportunities faced by social democrats 
in the post—oil crises era. The final topic concerns what 
the goals of social democracy should be. This arti- 
cle demonstrates that, in the presence of conflict be- 
tween different groups within labor, social democratic 
governments often do not promote the interests of 
the weakest members of society. The insider—outsider 
model opens the door to a debate about the desirability 
of this outcome. 


THE INSIDER-OUTSIDER PARTISANSHIP 
MODEL 


Like much of the literature that explores the rela- 
tionship between partisan government and economic 
policy, I understand political parties to have electoral 
objectives as well as commitments to ideology and to 
historically meaningful groups of voters. As Powell 
(1982) has pointed out, the existence of a relationship 
between “strong, continuing expectations about par- 
ties and the interests of social groups not only creates 
easily identifiable choices for citizens, it also makes it 
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easier for parties to seek out their probable support- 
ers and mobilize them at election time” (116). History 
and ideology, however, are not enough. Elections in- 
evitably revolve around issues (like employment pro- 
tection or labor market policies) because issues give 
political meaning to partisan attachments and social 
divisions (Dalton 2002, 195). 

Unlike most of the comparative political economy 
literature, I do not conceptualize labor as a homoge- 
neous political actor. I share an interest in disaggregat- 
ing labor with some recent works on the determinants 
of party strategies and individual policy preferences 
(e.g., Iversen and Soskice 2001 and Kitschelt 1994, 
1999). My analysis is based on two propositions: that 
labor is divided into insiders and outsiders and that the 
interests of insiders and outsiders are fundamentally 
different. 

I define insiders as those workers with highly pro- 
tected jobs. They are sufficiently protected not to feel 
greatly threatened by high levels of unemployment. 
Outsiders, on the other hand, are either unemployed or 
hold jobs characterized by low salaries and low levels of 
protection, employment rights, benefits, and social se- 
curity privileges. The interests of these two groups are 
fundamentally different because insiders care about 
their own job security much more than about the un- 
employment of outsiders and outsiders care about un- 
employment and job precariousness much more than 
about the employment protection of insiders. 

While dividing labor into insiders and outsiders has 
some precedents in both the economics and the po- 
litical science literature, integrating this division into a 
coherent conception of partisanship and policy-making 
represents a completely new endeavor. It is my con- 
tention that social democratic parties have strong in- 
centives to consider insiders their core constituency. 
There are historical and ideological reasons for this 
but there is also the fact that the other group within 
labor, outsiders, tends to be less politically active and 
electorally relevant (as well as less economically in- 
dependent) than insiders. I further argue that social 
democratic governments will side with their core con- 
stituency when faced with the choice between insiders 
and outsiders. Insiders are benefited by higher levels 
of employment protection legislation while, inasmuch 
as lower protection facilitates hiring, outsiders are 
not. Consequently, the main policy objective of social 
democratic parties will be the continuation or increase 
of insider job security. Higher levels of labor market 
policy, on the other hand, benefit outsiders, but not 
insiders. Both active labor market policies (ALMPs) 
and passive labor market policies (PLMPs) have the 
potential to benefit insiders, but more directly, they 
mean higher taxes and low-wage competition (more 
on this below). The implication of my insider—outsider 


1 There are two frameworks that inspire the model that I propose: 
work on dual labor markets (such as Berger and Piore 1980 and 
Doernger and Piore 1971) and the economic inmder—outsider ap- 
proach emphamzing the differences between the employed and 
the unemployed (see, for example, Blanchard and Summers 1986, 
Lindbeck and Snower 1988, and Saint-Paul 1996). 
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model therefore is that social democratic government is 
associated with higher levels of employment protection 
legislation but not of ALMPs or PLMPs. 

Dividing labor into insiders and outsiders also has 
implications for the strategies of conservative gov- 
ernments. Like many other authors, I consider con- 
servative parties to depend on a core constituency 
that consists of upscale groups (employers, the upper 
middle-class, and the business and financial commu- 
nity). Paradoxically, the insider—-outsider model implies 
that, in some cases, conservative governments may be 
able to pursue labor market policies that are more 
attractive to outsiders than those promoted by social 
democrats. Having the upscale groups as their core 
constituency makes it difficult for conservative parties 
to promote the interests of insiders. But ignoring in- 
siders allows conservatives to engage in some policies 
unavailable to social democrats. As mentioned above, 
outsiders favor lower levels of insider job protection 
legislation. By reducing insider job protection, conser- 
vative parties may attract some outsiders while rein- 
forcing the support of their core constituency (upscale 
groups who want flexible hiring and firing). While lower 
employment protection is favored by both outsiders 
and members of the upscale groups, this is not the case 
regarding labor market policies. Higher levels of ac- 
tive and passive labor market policies represent higher 
taxes and a more intrusive role for government in the 
economy. Because of this, upscale groups (and there- 
fore conservative governments) are not interested in 
the promotion of labor market policies. 

My model predicts, then, the following partisan dif- 
ferences regarding economic policies: (1) partisanship 
will significantly affect pro-insider policies—I expect 
social democratic governments to be associated with 
higher levels (and conservative ones with lower levels) 
of employment protection; and (2) partisanship will 
not significantly affect pro-outsider labor market poli- 
cies (neither social democratic nor conservative gov- 
ernments will promote ALMPs and PLMPs). 

It is important to emphasize that the existence of 
two distinct groups within labor only affects the strate- 
gies of partisan governments when there is a con- 
flict between insiders and outsiders. The coincidence 
of insider and outsider goals is possible in some pol- 
icy areas. For instance, some parts of the welfare 
state (like health care or education) may benefit in- 
siders and outsiders equally. Insofar as both insiders 
and outsiders need these welfare services, they will 
join in support of governments that promote them. 
In this respect, the insider-outsider model does not 
modify the conventional assessment of social demo- 
cratic governments as prolabor (insiders plus out- 
siders). The arguments presented in this paper, how- 
ever, integrate this conventional assessment into a 
more general and more accurate explanation of the 
political factors influencing partisan government. Put 
more boldly, although both frameworks predict similar 
outcomes for some social policies, only the insider— 
outsider model provides an explanation of the rea- 
sons why partisanship matters to some policies but not 
others. 
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INSIDERS, OUTSIDERS, AND THE 
TRADITIONAL VIEW OF PARTISANSHIP 


The main approach to the relationship between polit- 
ical parties and policy in comparative political econ- 
omy can be categorized as the traditional partisan- 
ship school. Its authors—Alt (1985) and Hibbs (1977) 
being the most cited examples—believe that social 
democratic governments will promote the interests of 
labor, while conservative ones will satisfy the demands 
of upscale groups. Labor is assumed to be dispropor- 
tionately affected by unemployment, and as a conse- 
quence, social democratic governments are expected to 
design economic policies that promote employment. 
Inflation is assumed to disproportionately influence 
upscale groups so conservative governments are ex- 
pected to promote policies that reduce price increases. 

The model proposed in this article should be con- 
sidered a transformation of the traditional partisan 
approach. I agree with the traditional partisanship au- 
thors in considering parties to have economic goals 
fundamentally related to those of their core constituen- 
cies. My insider—outsider analysis departs from their 
framework in its identification of the electorates that 
parties are interested in attracting. The difference re- 
sults from a disagreement about how unemployment 
affects labor. 

It is highly misleading to categorize all labor as vul- 
nerable to unemployment. Since the early 1970s, two 
factors have substantially decreased unemployment’s 
threat to labor. First, because of the growth and sta- 
bility of the late 1960s as well as the social unrest 
and union activism that characterized the early 1970s, 
firms accepted highly restrictive tenure and severance 
pay arrangements (see Bentolila and Bertola 1990 and 
Blanchard et al. 1986). As a consequence, a consid- 
erable proportion of labor became significantly insu- 
lated from unemployment. Second, as “insiderness” 
emerged, so did “outsiderness.” The unemployment 
growth experienced by many OECD countries dur- 
ing the post—Oil Shock crises contributed to the in- 
crease in outsiders. But labor supply shocks caused by 
larger numbers of women entering the labor force and 
a general intensification of international competition 
and working time flexibility’ are also part of the story. 
One of the consequences of these developments is a 
dramatic increase in part-time work and temporary 
contracts. The great majority of these jobs, however, 
pay poorly, are concentrated in low-skilled activities, 
and possess minimal rights and benefits. More impor- 
tantly, the precariously employed and the unemployed 
are the main group to suffer the consequences of eco- 
nomic fluctuations (being hired in good times and laid 
off in downturns). 

To assume that unemployment disproportionately 
harms labor as a whole and that social democratic 
governments therefore need to focus on the problem 
of unemployment is clearly inaccurate when analyzing 
the post-1973 period. 


2 See, for example, Dore 1994. 


INDIVIDUAL PREFERENCES IN THE 
INSIDER—-OUTSIDER MODEL 


My argument is fundamentally concerned with the rela- 
tionship between government partisanship and policy, 
and consequently, this is the focus of the quantitative 
exploration I develop in the next section. First, how- 
ever, I provide some survey data that preliminarily sup- 
port my expectations about the preferences of insiders, 
outsiders, and upscale groups. I should emphasize the 
illustrative nature of this section’s analysis. I am not 
presenting a systematic test but rather some initial ev- 
idence to demonstrate the plausibility of my model’s 
assumptions about individual preferences. 

The importance of a permanent job in my definition 
of insiders (as opposed to fixed-term or temporary con- 
tracts) and the need for questions related both to policy 
preferences and to labor market status limit the data 
I use to one survey: Eurobarometer 44.3 (February-— 
April 1996). Although this Eurobarometer survey 
provides only a snapshot of individual preferences, it 
does allow me to develop insider—outsider codings that 
closely address my claims. 

I define insiders as employed full-time with a perma- 
nent job or as those with part-time or fixed-term jobs 
who do not want a full-time or permanent job. This 
group includes individuals with permanent contracts 
(defined as not having a time limit). Outsiders are then 
defined as those who are unemployed, employed full- 
time in fixed-term and temporary jobs (unless they do 
not want a permanent job), employed part time (un- 
less they do not want a full-time job), and studying. 
Students are included in the outsider category both 
because they have no certainties about their future 
employment (even those who hope to become insid- 
ers or upscale managers can end up unemployed) and 
because in some cases they may have extended their 
education because of difficulties entering the labor 
market.’ The upscale group category, finally, contains 
those individuals who are self-employed (profession- 
als, owners of shops, business owners, and managers) 
as well as employed managers. 

The insider—outsider partisanship approach rests on 
some assumptions about the preferences of insiders, 
outsiders, and upscale groups. These preferences are 
summarized in Figure 1. 

There are two dimensions represented in Figure 1: 
labor market policy and employment protection. I have 
placed the three groups on these dimensions accord- 
ing to the preferences specified in the model. On the 
first dimension, insiders (who enjoy a high degree of 
job protection) are considered to be less affected by 


3 A more systematic analymıs can be found in Rueda 2001 and Rueda, 
forthcoming and is the focus of ongoing research. 

4 The sample mcludes the following countnes. Austria, Belgium, 
Denmark, Finland, France, West and East Germany, Great Bntain, 
Greece, Ireland, Italy, Luxembourg, the Netherlands, Northern 
Ireland, Norway, Portugal, Spain, and Sweden. 

5 In Rueda 2001, I analyze whether the preferences of groups within 
the three categories (msiders, outsiders, and upscale) are homo- 
geneous My results show that students share the preferences of 
outsiders. 
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unemployment and less interested in dedicating more 
resources to labor market policies. Outsiders are most 
vulnerable to unemployment and therefore more con- 
cerned about active and passive labor market policies. 
As for the upscale groups’ preferences their position is 
justified by their desire to reduce the taxes that pay for 
these policies and a general inclination to limit the role 
of government in the economy. 

There are reasons for insiders to favor higher levels 
of labor market policy. Insiders face some probability 
of losing their jobs (when companies become econom- 
ically unviable, for example) and labor market policies 
can reduce the intensity of job searches by outsiders 
and therefore reduce competition for wages. But the 
reasons for insiders to oppose labor market policies 
are more powerful. An increase in the levels of active 
or passive labor market policies, after all, represents a 
higher tax burden for insiders. Additionally, some of 
these policies may, if successful, promote the entry into 
employment of individuals who can underbid insiders’ 
wage demands.° As Saint-Paul (1998) has argued, when 
insiders feel protected enough not to significantly fear 
unemployment, lack of support for ALMPs may result 
from the insiders’ interest in being sheltered from low 
wage competition (162).’ 

On the second dimension, insiders are expected to 
be strongly in favor of employment protection, while 
upscale groups and outsiders are expected to place 


® See Calmfors 1994 and Saint Paul 1998 For the relationship be- 
tween the effects of actrve and passive labor market policies, see 
Calmfors 1993 For an anatysis showing that the effects of ALMPs on 
labor market competition may be dependent on whether they target 
particular mdividuals, see Calmfors and Lang 1995. Regarding the 
latter point, the implication for my analysis is related to one of the 
starting assumptions: that ALMPs target outsiders 

7 Tt is clear that the emergence of employment protection described 
in the previous section amplifies unsider—outsider differences The 
higher the level of protection, the more important the inmder- 
outsider conflict, as one group (mmnders) increasingly pays the costs of 
ALMPs and PLMPs while the other (outsiders) increasingly benefits 
from them. 
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themselves closer to the other side of the spectrum. 
It is clear that lowering employment protection leg- 
islation directly attacks the interests of insiders. The 
preferences of outsiders are justified by their belief that 
lower employment protection will facilitate their exit 
from unemployment and precarious employment. Ac- 
cording to many, employment protection “is, in effect, 
a tax on work-force adjustments” (OECD 1999, 68), 
and as such it may inhibit firms from shedding labor 
in economic downturns but also from hiring in perti- 
ods characterized by good performance. The upscale 
groups (especially employers and managers) clearly 
benefit from the flembility of lower levels of employ- 
ment protection. 

To test whether the preferences presented in 
Figure 1 are accurate I turn to the Eurobarometer sur- 
vey. Figures 2a and 2b reflect the responses of insiders, 
outsiders, and upscale groups to two questions that 
address active labor market concerns. In the first ques- 
tion, respondents were asked whether they would tend 
to agree or disagree with the following statement: “The 
government should offer a guarantee of training, or a 
job, to all young people leaving school.” Responses that 
agreed were given a 10 and those that disagreed were 
given a 0. The x-axis in Figure 2a measures the mean re- 
sponse of the three groups. As theorized, outsiders are 
strong advocates of guaranteed training and jobs for 
young people, while both insiders and upscale groups 
believe this is a much lower priority. The expected pref- 
erences are also confirmed when the question moves 
away from the relatively abstract issue of job and train- 
ing guarantees to a more concrete issue. Respondents 
were asked whether they would tend to agree or dis- 
agree with the following statement: “I would be ready 
to pay more tax if I were sure that it would be devoted to 
creating new jobs.” Again, agreements were given a 10 
and disagreements a 0. The x-axis in Figure 2b measures 
the mean response of the three groups. Although this 
time the scores are lower, the relative positions of the 
groups are almost identical. As expected, outsiders are 
clearly in favor of employment promotion measures 
even if they imply increasing taxes, while insiders and 
upscale groups display much lower levels of support. 

The numbers in Figures 2a and 2b are suggestive, but 
an initial assessment of their statistical significance can 
help confirm their meaningfulness. I estimate Pearson 
chi-square statistics to test whether a significant rela- 
honship exists between being an insider, outsider, or 
upscale individual and holding the specific opinions 
about employment promotion contained ın the figure. 
The null hypothesis in this test is that there is no asso- 
ciation. The results suggest (at better than a 99% level 
of confidence) that a relationship does exist between 
insider—outsider—upscale status and employment pro- 
motion preferences. This is the case for the numbers in 
both figures so the significance of the association does 
not depend on the question.® 


8 For a more systematic test of the implications of the msider— 
outsider model for employment promotion preferences, see Rueda, 
forthcomung. 
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The numbers in Figures 2a and 2b are a good repre- 
sentation of ALMP preferences but they do not reflect 
concerns about the levels of passive labor market poli- 
cies, This is because of the absence of any question in 
the survey addressing PLMPs. The questions asked are 
either too general (Is social welfare a necessity?) or 
related to areas excluded from PLMPs (Should health 
care or education be guaranteed?). We can turn, how- 
ever, to the analysis of PLMP preferences in Boeri, 
Boérsch-Supan, and Tabellini 2001, which shows that 
the preferences in my model are in fact reasonable. 
They find that the individuals I define as outsiders 
would be ready to accept higher costs (i.e., taxes) 
in return for more unemployment insurance. Insiders 
and upscale groups do not seem to share these pre- 
ferences. 





Turning now to the third dimension in my analy- 
sis, Figure 2c depicts the job security preferences of 
insiders, outsiders, and upscale groups. Respondents 
were asked the following question: “For you personally, 
how important do you think each of the following is in 
choosing ajob?” Respondents were then given several 
characteristics that they could rate from very important 
to not important at all. Responses that considered a se- 
cure job very important were given a 10 and those that 
did not were given a 0. Again, the mean preferences of 
the three groups confirm the hypothesis in Figure 1. As 
expected, insiders are most concerned about job secu- 
rity, while outsiders and upscale proups are much less 
likely to consider job security very important. Whether 
an individual is an insider, an outsider, or a member 
of the upscale groups does prove to be a statistically 
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significant determinant of his/her attitude toward job 
protection. A Pearson chi-square test shows that the 
association between these two variables is significant 
at better than the 99% level of confidence. 


THE EFFECTS OF PARTISANSHIP 
ON POLICY 


The data presented in the previous section illustrate 
that my partisanship model’s expectations about indi- 
vidual preferences are reasonable. In the next pages I 
explore whether parties do in fact develop policies in 
line with these preferences. I focus on two measures: 
labor market policies and employment protection leg- 
islation. 


The Dependent Variables 


Labor Market Policies. PLMPs provide unemploy- 
ment compensation, whereas active ones are aimed 
at reducing unemployment by shaping the supply, de- 
mand, and mobility of labor. The OECD data used in 
my statistical analysis include unemployment benefits 
as the main component of PLMPs. The ALMP measure 
encompasses the following five areas: V public em- 
ployment services and administration, (2) labor market 
training, (3) youth measures, (4) subsidized employ- 
ment, and (5) measures for the disabled. 

While PLMPs (as an important element of the wel- 
fare state) have received quite a lot of attention in 
the comparative political economy literature, the re- 
lationship between ALMPs and government partisan- 
ship has been underexamined. Economists have been 
concerned mostly about their effects on employment 
and have generally ignored the role of partisanship 
in promoting different levels of ALMP. In political 
science, some authors have explored the relationship 
between partisanship and ALMPs but some important 
questions have been left unanswered. The effects of 
divisions within labor, in particular, have not been ana- 
lyzed 1 in detail since the starting point for most political 
scientists is to consider ALMPs one more measure that 
social democratic parties will employ to benefit labor 
(see, e.g., Boix 1998 and Janoski 1994, 1990). 

As explained in more detail in the previous section, 
outsiders are considered the main beneficiaries of labor 
market policies in the model proposed in this article. 
Before the widespread adoption of employment pro- 
tection in the early 1970s, the interests of insiders and 
outsiders regarding labor market policy were closely 
aligned. Insider vulnerability to unemployment was 
higher and social democratic governments could pro- 
mote labor market policies that favored outsiders. The 
emergence of employment protection causes the inter- 
ests of insiders and outsiders to diverge. As ALMPs and 
PLMPs increasingly become policies that insiders pay 
the costs of while outsiders receive the benefits from, 
social democratic governments become less likely to 
promote them and partisanship becomes insignificant. 


Employment Protection Legislation. Employment 
protection legislation affects “the rules governing un- 
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fair dismissal, lay-offs for economic reasons, severance 
payments, minimum notice periods, administrative au- 
thorization for dismissals and prior discussion with la- 
bor representatives” (OECD 1994, 69). The previous 
section made clear the reasons behind the preferences 
of insiders, outsiders, and upscale groups. Because in- 
siders and upscale groups have opposing interests and 
they are the core constituencies of social democratic 
and conservative parties, the insider—-outsider model 
implies the existence of marked partisan difference re- 
garding job protection. 

I use two different measures of employment protec- 
tion legislation. The first one is the mean for the 1980s 
and the 1990s of the OECD's overall protection against 
dismissals index. The index is constructed by averaging 
the scores obtained by each country in three categories: 
“procedural inconveniences which the employer faces 
when trying to dismiss employees; notice and sever- 
ance pay provisions; and prevailing standards of and 
penalties for unfair dismissal” (OECD 1999, 54). Con- 
ceptually, this index is ideal for testing my hypotheses. 
It suffers, however, from the important practical limita- 
tion of being available only as a summary value for the 
1980s and the 1990s. I want to use yearly data that allow 
a significant increase in the number of observations 
and in the complexity of the estimated models. For this 
reason, I also use a measure of the number of months of 
severance pay a blue-collar worker with 10 years of ser- 
vice receives upon termination without cause.? Cause 
is illustratively explained by Lazear (1990) as generally 
meaning “for reasons having to do with the worker’s 
own shortcomings, and it must be extreme. A reading 
of the rules Suggests that in most countries, dismissal 
with cause requires the kind of evidence necessary to 
withdraw an American academic’s tenure” (708). 


The Explanatory Varlable: Government 
Partisanship 


The government partisanship measures used in my 
analysis attempt to capture the ideological position of 
governments in relation to a left-right continuum. Two 
variables are needed for the construction of these mea- 
sures: one that reflects the presence of parties in gov- 
ernment and another that measures their ideological 
characteristics. The operationalization of the first vari- 
able has been relatively straightforward in the compar- 
ative politics literature. But important questions sur- 
round the measurement of party ideological positions. 
Assessments of left-right party positions are based 
on two sources: expert opinions and party manifestos. 
These two measures imply a different set of complica- 
tions. Expert opinions are produced from surveys that 
are administered rarely and that may be interpreted 
differently in different national contexts (Gabel and 
Huber 2000). Data extracted from party manifestos, on 
the other hand, can be criticized for being a reflection 
of what parties say to win elections, and not necessarily 
of what they will do once they have won them. 


’ For an analysis of the close relationship between severance pay and 
overall employment protection, see OECD 1994. 
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In this paper, I try to avoid some of these compli- 
cations by using two measures of government parti- 
sanship. The main results reported below use parti- 
san cabinet composition as measured by Tom Cusack 
(1997).° The reason for this choice is mostly practical. 
Many analyses dealing with the effects of government 
partisanship on policy use this measure and I want 
my results to be easily compared to those obtained 
by other authors. After presenting results using cabi- 
net partisanship, I verify the conclusions of my initial 
analysis with a measure of government partisanship 
that uses party manifestos to assess a party’s left-right 
position.”’ For the construction of this government par- 
tisanship measure, a party’s average left-right position 
is multiplied by its cabinet weight. In this case, the 
cabinet weight is the proportion of parliamentary seats 
that parties in governments posses. This is similar to 
Cusack’s measure because governments tend to “ap- 
portion their cabinet portfolios to parties in simple 
proportion to the relative percentage of seats held by 
each in the lower house of the legislature” (Powell 2000, 
173). 

I would like to address a final point with refer- 
ence to the government partisanship measures used 
in the analysis. As mentioned above, the variables 
calculate the ideological position of governments in 
relation to the partisan composition of cabinets. This 
means that parties other than the social democratic and 
conservative ones influence the weighted partisanship 
measure.“ This, however, does not affect the insider— 
outsider partisanship model or the findings described 
below. Regarding partisan options to the left of the 
social democrats, communist parties are strongly pro- 
insider. Most communist parties in Europe rely even 
more on the support of insiders (through both their 
votes and their participation in unions) than social 
democratic ones and they have been just as reluctant 
to integrate the interests of outsiders. The government 
participation of communist parties in the sample that I 
analyze is limited enough, in any case, not to affect the 
conclusions made about the influence of partisanship 
over policy. 

When considering Christian democratic parties (usu- 
ally placed in the moderate right), the implication of 
my analysis is that these parties promote policies that 
can be placed in between social democracy and conser- 
vatism for the dimensions I focus on. This seems real- 
istic. Liberal parties are placed either in the moderate 


10 Higher figures signify more conservative government. Cusack 
groups parties into five families, multiplies each family’s share of 
cabinet portfolios by its weight, and sums the products. See Cusack 
1997 for further details 

4 This variable relies on party programs for the codification of pol- 
icy emphases and it was produced by the Comparative Manifestos 
Project Source for all countries but Japan: McDonald and Mendes 
2001. Data for Japan were created by the author. Sources: Com- 
parative Manifestos Project left-right party index and Woldendorp, 
Keman, and Budge 2000. Given the variance of the ideology measure, 
the 1973-95 average 1s used ın the analysis. 

12 Although I refer to social democratic and conservative parties ın 
the Results, it would be more accurate to refer to the partisan options 
as left and night. 
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right or between the Christian democrats and the con- 
servatives. Considering the two policies emphasized in 
the previous pages, this seems uncontroversial. The 
free-market philosophy of liberals places them close 
to conservatives regarding employment policy and job 
protection. 


Other Varlables 


Labor Market Institutions. Both because of their di- 
rect involvement in industrial relations (negotiations 
covering work regulations and wages being the most 
clear examples) and because of their capacity to in- 
fluence political parties, the behavior of unions is a 
relevant factor in a government’s decisions over policy. 
I emphasize two labor-related factors: the centraliza- 
tion/coordination of wage bargaining and union den- 
sity." 

International and Financial Openness.“ There are 
two contradictory accounts of the effects of interna- 
tionalization on partisan politics. First, there is a large 
literature suggesting that growing levels of interna- 
tional openness and interdependence result in a blur- 
ring of partisan differences caused by the inability of 
social democratic parties to produce policies that do 
not conform to market forces (see, e.g., Iversen 1996 
and Scharpf 1991). Then there are some authors who ar- 
gue either that international forces do not affect some 
partisan differences (like Boix 1998 and Garrett and 
Lange 1991) or that they actually have strengthened 
the influence of partisanship on policies and economic 
outcomes (Garrett 1998). The results presented in the 
following pages do not address whether international 
dependence limits the autonomy of governments, In- 
stead I look at the great variance of economic policy 
within the sample and try to assess the factors that 
are responsible for it. My hypothesis is that, once the 
influence of internationalization is controlled for, par- 
tisanship will account for this variance only in the case 
of employment protection policies. 


Government Debt. I introduce government debt 
into the analysis as a measure of the availability of 


D The differences between liberals and conservatives (in individ- 
ual freedom issues such as abortion and divorce) pertain to policy 
dimensions not discussed in this paper 
14 See Iversen 1999 for a complete specification of the centraliza- 
tion/coordination variable. To capture the inertia associated with 
institutional change, I use a moving average of the yearly values 
(present and previous four years) It should also be noted that values 
for the last two years in the tme senes were extrapolated. The union 
density measure used in the regressions represents employed union 
members as a percentage of the employed labor force. The pre-1990 
figures were taken from Visser 1996, post-1990 figures were provided 
7 Bernhard Ebbinghaus (Max-Planck Institute) 

International openness is measured as imports plus exports as 
a percentage of the GDP. Source: OECD electronic database and 
OECD Historical Statistics 1960-95. Financial openness 1s measured 
as the sum of several indexes for financial restrictions For details, 
see Armingeon, Beyeler, and Menegale 2002. 
16 Source: Franzese 1998 Given the possibility of endogeneity 
(higher levels of policy causing higher debt), I use a one-year lag 
for this variable 
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resources affecting a government’s choices. One widely 
accepted interpretation of the policy changes of the 
early 1980s, for example, is that many governments 
had reached unsustainable levels of public debt (see 
Schwartz 1994). I use the level of consolidated central 
government debt as a percentage of GDP as the indi- 
cator and include it only in the regressions with ALMP 
and PLMP as the dependent variables (since any re- 
lationship between resource limitations and legislation 
affecting employment protection seems unlikely). 


Unemployment.” Some authors have argued that 
policies simply result from increasing needs—whether 
demographic, economic, or other (see, e.g., Wilensky 
1975). I engage these arguments by controlling for the 
effects of unemployment. It is important that my results 
are not affected by needs. I want to be able to conclude, 
for example, that social democratic governments do 
not promote high levels of ALMPs regardless of the 
size of unemployment. Unemployment also acts as a 
proxy for the number of outsiders in an economy. It 
is essential for this article’s conclusions that the re- 
sults control for the size of the outsider group, since 
the insider—outsider partisanship model maintains that 
social democratic governments will appeal to insiders 
even when outsiders are numerous. 


GDP Growth. Most analyses of economic policy 
include a measure of economic growth. This is particu- 
larly important here because of the need to control for 
the effects of growth on the behavior of governments. 


METHODOLOGY 


I use annual data from 16 countries from 1973 to 1995 
and present ordinary least squares (OLS) results.’ The 
pooled data significantly increase the number of obser- 
vations and therefore allow me to test more complex 
causal models. I also include a lag of the dependent 
variable among the regressors. Since the chosen de- 
pendent variables exhibit noticeable time stability, the 
introduction of a lagged dependent vanable provides 
a better dynamic model in which the influence of the 
previous year’s values is explicitly assessed. ‘To further 
capture cyclical factors I include time period dummies 
in the regressions.” 

Beck and Katz (1995, 1996) have proposed a method 
that produces consistent standard errors estimates 
in the presence of panel heteroscedastic errors. Since 
their recommendations have been widely followed in 


17 The measure used 1s the standardized unemployment rate for all 
countnes but Austria, Denmark, and Switzerland. For these three 
countries, I use regular unemployment rates. Sources: OECD His- 
torical Statistics 1960-95 and 1960-97 Given the possibilty of endo- 
geneity, I use a one-year lag. 
8 GDP growth is measured as year-to-year tage changes. 
Source: OECD electronic database and OECD Historical Statistics 
1960-97. 
19 The countries are Australia, Austria, Belgium, Canada, Denmark, 
Finland, France, Germany, Italy, Japan, the Netherlands, Norway, 
Sweden, Switzerland, the United Kingdom, and the United States 
20 The periods are 1973-79, 1980-84, 1985-89, and 1991-95. The 
excluded reference year 1s 1990. 
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the recent comparative political economy literature, I 
estimate panel-corrected standard errors. 

I therefore regress the dependent variables on a 
one-year lag of the dependent variable, independent 
variables, and period dummies. That is, I estimate the 
following equation: 


Ya = Wit- + X Bitni +T F Uy, 
k 


where i refers to the cross-sectional units, t to the time 
units, k to the number of independent variables, r to 
the time period intercepts, £ to the slopes of the ex- 
planatory variables, y, ;_; to the lagged dependent vari- 
able, and the x’s are the independent variables. 

The only regression not run according to these spec- 
ifications is the one that uses two time averages of the 
protection index per country instead of yearly data. 
Given the small n, in this case I did not estimate panel- 
corrected standard errors and simply ran a bivariate 
OLS regression on the variable of interest. 


RESULTS 


Table 1 provides the estimates for the determinants 
of labor market policies. Here, it is most important to 
point out that, as hypothesized, cabinet partisanship is 
not significant as an influence on the levels of ALMP 
or PLMP. The table clearly shows that whether a gov- 
ernment is social democratic or conservative makes no 
difference to the levels of ALMPs promoted. It is also 
important to emphasize that these results contradict 
the conventional wisdom, and much of the existing 
literature, regarding the influence of partisanship on 
active policies (see, e.g.. Boix 1998, Janoski 1990, 1994, 
and Swank and Martin 2001). This article represents 
a considerable improvement on most of those done in 
the past because of the number of cases and factors 
included in the analysis.” 

Table 1 also reveals that government partisanship is 
an insignificant determinant of passive labor market 
policies. As hypothesized, social democratic govern- 
ment is not associated with greater levels of PLMP. 
These results reinforce the conclusions of the regres- 
sion analyzing ALMPs: when insiders do not share the 
goals of outsiders, social democratic governments do 
not promote pro-outsider policies. Other authors have 
observed results similar to those presented in Table 1. 
In a very direct fashion, they are confirmed by the anal- 
ysis in Moene and Wallerstein (2003).~ They analyze, 
among other things, the effects of Right government 


7l Most existing analyses rely on a very limited number of 
observations—Borx’s (1998) regressions, for example, range from 
18 to 21 observations (75-79), and Janoski’s (1994) from 36 to 38 
(70-78). This severely Irmits the possibility of systematically testing 
alternative hypotheses while smmultaneously controlling for other 
relevant factors (these regressions typically have no more than three 
variables), which in turn introduces great caveats mto the significance 
of the results. Others do not systematically assess the validity of their 
claims across countries and through time, which limits the general- 
ability of their conclusions (see, for example, the country-specific 
analysis in Janoski 1990 and 1994). 

2 And also by the analysis of the United Kingdom in King 1995. 
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TABLE 1. The Determinants of Labor Market 


Pollcy, 1980—95 


ALMPs PLMPs 


Constant 


Lagged dependent variable 


Cabinet Partsanship 
Union Denstty 
Bargalning Centrallzation 
Internatonal Openness 
Financial Openness 

Lag of Govemment Debt 


Lag of Standardized 
Unemployment Rate 


GDP Growth 


N 

R? .93 
Source. OECD Social Expenditures Database 2000, except 
Switzerland and Austria (only for PLMP), OECD Employment 
Outlook 
Note: ALMP and PLMP spending as % of GDP. Higher values 
of the Cabinet Partisanship vanable mean more conservative 
governments. Numbers In bold are estmated coefficients; num- 
bers In parentheses are thelr panel-corrected standard errors; 
numbers In italics are p-values from one-sided tests Pertod 
dummy estimates are not reported (avallable upon request) 





on the major categories of welfare state spending and 
do not find a partisan effect on an aggregate measure 
of ALMPs and unemployment benefits. 

In Table 2, I estimate some alternative models to 
confirm the results presented in Table 1 (only the vari- 
able of interest is reported). The first column repre- 
sents the same regressions used for the main results 
with the inclusion of country fixed effects. Following 
Hsiao (1986), I include country dummies to control 
for those influences that are country specific and that 
could affect the accurate estimation of the variables of 
interest (country-specific omitted variables).~ Cabinet 
partisanship is an insignificant determinant of ALMPs 
(as was the case with Table 1) but a significant variable 


D Dummies for all countries are present in the regressions with fixed 
effects. I ran these regressions without a constant. 
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influencing PLMPs.* However, as hypothesized, the 
sign of the coefficient indicates that social democratic 
government is not associated with greater levels of 
PLMP. The sign seems to suggest that Right govern- 
ment (rather than Left government) is in fact associ- 
ated with more generous PLMPs. 

A number of authors have argued that governments 
dominated by Christian democratic parties promote 
generous welfare state policies (see, e.g., Hicks and 
Swank 1992 and Huber and Stephens 2001). Since this 
could be the explanation for the results using cabi- 
net partisanship, I run a regression following the same 
specifications but with Christian democratic share of 
cabinet seats instead of government partisanship.» 
Christian democracy turns out to be insignificant as a 
determinant of PLMPs. This means that the insignif- 
icance of partisanship effects observed in Tables 1 
and 2 is not caused by ignoring Christian democratic 
effects. 

The second column in Table 2 substitutes the ex- 
pert opinion—based measure of partisanship for one 
that uses party manifesto data. While the results in 
Table 1 are confirmed regarding ALMPs, the use of this 
alternative measure results in a significant partisan- 
ship effect on PLMPs. Although the sign of the coef- 
ficient indicates that more social democratic govern- 
ments promote higher levels of passive labor market 
policies, the size of the coefficient (—.002) suggests 
that the substantial effect of this variable is extremely 
small. 

Some authors have argued that strong labor move- 
ments allow leftist governments to promote low un- 
employment policies (see Alvarez, Garrett, and Lange 
1991 and Garrett 1998). The lack of partisanship effects 
in Table 2 could result from misspecifying the rela- 
tionship between labor market institutions and gov- 
ernment partisanship. I therefore test the Left labor 
hypothesis in the third column in Table 2. I substitute 
cabinet partisanship for Garrett’s (1998) variable cap- 
turing the interaction between social democratic gov- 
ernment and the power of labor. Since Garrett’s Left 
labor power index is not available after 1990 (or for 
Switzerland and Australia), I recreate the index by 
adding together social democratic government, the 
level of centralization/coordination of wage bargain- 
ing, and union density. This is a good proxy for Garrett’s 
index since it is highly correlated to it (the correlation 
coefficient is .91). Following Garrett, interactions with 
international and financial openness are introduced, 
while union density and bargaining centralization are 


44 Nickell (1981) demonstrates that, with short panel data, OLS esti- 
mation of models with lagged dependent vanables and fixed effects 
produces biased coefficients I therefore check these fixed-cffects 
results by using a two-stage instrumental variable procedure similar 
to the one explained in Rueda and Pontusson 2000. The findings 
regarding ALMPs are confirmed but government partisanship loses 
significance as a determinant of PLMPs ın the instrumental variable 
model (confirming the main results in Table 1). 

Z Results not reported but available from the author. Source for 
Christian democratic share of cabinet seats’ Duane Swank, Political 
Science Department, Marquette University. 
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TABLE 2. The Determinants of Labor Market Pollcles, Alternative Models 
Cabinet Partisanship, 


Government Partisanship, 


Measured with 
Party Mantfesto Data 


221 


Note: See Table 1 for details about the regressions Higher values of the Cabinet Partsanship variable mean 


more conservative governments Numbers in bold are estimated coefficients; numbers In parentheses are thelr 
panel-corrected standard errors, numbers In Italics are p-values from one-sided Hests. All other estimates are not 


reported (available upon request). 


eliminated from the model. When analyzing both 
ALMPs and PLMPs, the Left labor power variable is 
insignificant (as expected). The openness interactions 
are also insignificant. It seems, therefore, that even 
when the influence of social democratic governments, 
wage bargaining, and union strength is put together, 
the main observations made about Table 1 stand. 

The analysis presented in Tables 1 and 2 follows 
the standard procedures of recent comparative politi- 
cal economy work relying on pooled time series data. 
The models, however, ignore the question of whether 
the impact of social democratic government is subject 
to a lag. The results presented in the previous pages 
are therefore vulnerable to the criticism that they do 
not show a significant government partisanship effect 
simply because they do not capture social democratic 
influences only discernible after some time. To address 
this issue, I run the regressions in Table 1, substituting 
yearly cabinet partisanship for a 2-, 5-, and a 10-year 
average. The main results hold and government parti- 
sanship was found to be an insignificant determinant 
of ALMPs and PLMPs no matter the lag. 

If the first two tables make clear the lack of a pro- 
outsider orientation in governments of the Left when 
examining the determinants of employment protection 
legislation, their pro-insider behavior seems equally 
obvious from the analysis in Table 3. Leaving aside 
the lagged dependent variable, only two factors are 
significant at better than the 95% level of significance: 
partisanship, international openness, and the previous 
year’s unemployment rate. As expected, social demo- 
cratic governments are significantly associated with 
higher severance pay. In other words, social democracy 
has promoted higher employment protection legisla- 
tion (or protected insiders) during the period of time 
under analysis. This is all the more meaningful when 
we remember that in many OECD countries unem- 
ployment increased dramatically during this period. 

The substantive significance of these findings is per- 
haps best illustrated by simulating the effects of a 
change in government partisanship while keeping the 
other variables constant. A change in the partisanship 
measure from a rightist government to a leftist one 
(from a 4 to a 2 in Cusack’s index) would be associated 
with an immediate increase in employment protection 


70 





TABLE 3. The Determinants of Employment 
Protection, 1973-95 


Constant 

Lagged dependent 
varlable 

Cablnet Partisanship 


Union Density 
Bargaining 
Centrallzation 


International 


Openness 


Financlal Openness 


Lag of Standardized 
Unemployment 


R? 


Severance Overall Employment 


32 
92 .14 


Note: Higher values of the Cabinet Partisanship vanable mean 
more conservative governments. Severance Pay Notes. The 
data are the correction of Lazear's figures by Addison, Grosso, 
and Telxealra, updated for the 1992-85 period using OECD 
(1999). See Lazear 1990 and Addison and Grosso 1997. Num- 
bers In bold are estimated coefficients; numbers in parenthe- 
ses are thelr panel-corrected standard errors; numbers fn Italics 
are p-values from one-sided Hests. Because of missing data, 
Canada 1s not included in these regressions. Estimates for pe- 
rod dummies are not reported (avaliable upon request). Overall 
Employment Protection Notas. Source: OECD (1999), Table 2.2, 
Panel B All entnes are bivanate OLS estimates. Numbers In 
bold are estimated coefficients, numbers in parentheses are 
their standard errors; numbers In talks are p-values from one- 


sided tests. 
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TABLE 4. The Determinants of Employment 
Protection, Alternative Models 
Cabinet Partisanship, Government Partisanship, 
Model with Country Measured with 
Fixed Effects Party Manrfesto Data 


—.076 000 
(-050) 
.066 


(.002) 

.464 

Note. See Table 3 for detalls about the regressions. Higher val 
ues of the Cabinet Partisanship variable mean more conserva- 
tve governments Numbers in bold are estimated coefficients: 
numbers In parentheses are their panel-corrected standard er- 
rors, numbers In italics are p-values from one-sided Łests. AJI 
other estimates are not reported (available upon request) 





equal to .148 of a month of severance pay. The long- 
term effect of this change in government partisanship 
would equal 4.111 months of severance pay.” 

I have complemented this analysis with a bivariate 
regression that takes the OECD’s overall employment 
protection index as its dependent variable. Since only 
figures corresponding to the “late 1980s” and “late 
1990s” exist, I regress overall employment protection 
on cabinet partisanship averages of the 1973-87 and 
1988-95 periods. The goal, given the limitations in 
terms of both the number of observations and the na- 
ture of the regressor, is simply to confirm the previous 
severance pay findings. And the results do just that. 
As in the first severance pay regression, social demo- 
cratic governments are strongly associated (surpassing 
the 99% significance level) with more protection for 
insiders. 

‘Table 4 tests the robustness of the employment pro- 
tection conclusions by exploring some alternatives. The 
first column indicates that a model with country fixed 
effects produces similar results to those in Table 3. 
Government partisanship is a significant influence over 
employment protection legislation when we control for 
country-specific omitted variables.” This is, however, 
not the case in the second column, when the measure of 
partisanship based on expert opinions is substituted for 
one that uses party manifesto data. It seems that when 
we look into what parties say (and not what they do), 
there is not a connection between social democratic 
government and protection policy. The fact that social 
democratic government would decide not to empha- 
size their pro-insider strategies regarding employment 
protection (especially in times of high unemployment) 
is not necessarily surprising. 

Tests for robustness in the regression using the 
OECD’s overall employment protection index are 
complicated, due to the small number of observations 
in the sample. In Table 3, it is assumed that the observa- 
tions for each country in the 1990s are independent of 


76 On the assumption that the effects of a one-unit change in a par- 
ticular variable persist, the long-term effects of such a change can 
be computed by drviding the value of the coefficient for the variable 
of interest by one minus the coefficient for the lagged dependent 
variable. 

77 Government partisanship, however, becomes insignificant in the 
instrumental variable mode] 


those in the 1980s. To explore this issue, I test whether 
the results hold in a cross-sectional design (with only 
16 observations). I regress employment protection in 
the 1990s on each country’s cumulative measure of 
government partisanship from 1964 to 1995. Although 
the significance level of government partisanship di- 
minishes, the results in Table 2 are confirmed in this 
analysis. Social democratic government is still found 
to be associated with more insider protection (in spite 
of the small number of observations, the variable is 
significant at better than the 90% level).” 

As was the case with the labor market policy models, 
the analyses presented in Tables 3 and 4 do not address 
whether the influence of government partisanship is 
subject to a lag. To explore this issue, I follow the 
strategy described before. In this case, however, the ob- 
jective is different. In the labor policy analysis, the ex- 
ploration of different lags was meant to check whether 
the statistical insignificance found was the result of not 
giving government partisanship enough time to affect 
policy. In the employment protection case we know 
that government partisanship has a significant imme- 
diate effect but we want to see whether there is also a 
long-term one. J run the regressions in Table 3 substi- 
tuting yearly cabinet partisanship for 2-, 5-, and 10-year 
averages. Government partisanship was found to be a 
significant determinant of employment protection with 
the 2-, and 5-year lags, but an insignificant one when 
the lag was longer. These results suggest that the effects 
of government partisanship on employment protection 
are strongest in the short term. 


CONCLUSION 


Going back to the initial questions that motivated 
this article’s analysis, the previous pages have demon- 
strated that insider—outsider politics are fundamental 
to a fuller explanation of government partisanship, 
policy-making, and social democracy since the 1970s. It 
is also clear that recent social democratic governments 
have not promoted some of the policies we would ex- 
pect. The strategies prevalent in the golden age of social 
democracy have been abandoned and the provision of 
equality and security to the most vulnerable sectors of 
the labor market has been sacrificed to satisfy other 
objectives. My evidence suggests that, in the presence 
of insider—outsider conflict, there is a strong tempta- 
tion for social democratic governments to implement 
inegalitarian policies. 

Herbert Kitschelt (1999) has accurately pointed out 
that the study of party strategies requires a “bridge 
across the familiar divide between students of com- 
parative political economy and parties and elections” 
(318). The model I have presented attempts to do this 
by combining research on political economy, institu- 
tions, and political behavior. Political economists often 
assume that the preferences of parties are exogenous 
and stable and that they are mediated by institutional 
configurations that produce differentiated political and 


Z Results available from the author 
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economic outcomes. The analysis in this article, how- 
ever, focuses on the transformation in party strategies 
that results from new voter demands. I show that eco- 
nomic factors affect the preferences of particular elec- 
torates and, as a consequence, the strategies of partisan 
governments. 

My analysis also provides a useful framework with 
which to analyze more recent partisanship debates. 
There is an ongoing discussion, in both academic and 
more general circles, about whether a “third way” has 
emerged as a comprehensive philosophy uniting so- 
cial democratic parties around a coherent set of policy 
alternatives. According to some analysts, a third way 
does indeed exist and it is distinguished by an emphasis 
on employment promotion as a goal and by the use of 
active labor market policies (see, e.g., Giddens 1998). 
The arguments presented in this article emphasize the 
importance of taking into consideration insider- 
outsider distinctions to understand these policy devel- 
opments. Taking this article’s framework as our guide, 
we would speculate that a number of factors could 
make pro-outsider policies more attractive to social 
democratic governments. Among them an increase in 
the number of outsiders past a possible critical point, 
a weakening of unions, or even a decrease in the 
level of “insiderness” come to mind. In fact an argu- 
ment could be made that in the United Kingdom, it 
was the influence of these last two factors that facil- 
itated the emergence of Blair’s third way. The mod- 
eration of unions in the late 1990s and the influence 
of Thatcherism are surely factors influencing the poli- 
cies of New Labour. Thatcher, Giddens (1998) argues, 
“attacked established institutions and elites.... The 
Labour Party and its intellectual sympathizers first of 
all responded largely by reaffirming old left views. The 
electoral setbacks the party suffered by so doing, how- 
ever, necessarily stimulated a new orientation” (ix). 

This article’s arguments shed some light on two addi- 
tional partisan issues. First, a number of scholars have 
explored the reasons behind the decline in partisan- 
ship observed at the individual level in recent years 
(Dalton 2002). My analysis suggests that insider- 
outsider differences help explain why some people 
do not feel represented by mainstream political op- 
tions. It is reasonable to assume that outsiders would 
be increasingly less likely to identify with parties that 
do not defend their interests. At the aggregate level, 
analysts have also perceived a progressive decline in 
the political prominence of social democracy (see, e.g., 
Pontusson 1995). My results suggest that this view may 
be inaccurate and that partisanship is still a powerful 
determinant of policy. We cannot observe these effects, 
however, if we do not look at the right policies. I argue 
that only by understanding insider—outsider differences 
will we perceive the true influence of partisanship on 
policy-making. 

The second partisan issue relates to the costs of social 
democratic strategies. The model presented in this ar- 
ticle opens the door to a debate about the implications 
of emphasizing the interests of insiders. Given some 
recent electoral setbacks in industrialized democracies, 
it is uncertain insider strategies are beneficial to social 
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democratic parties. Perhaps more importantly, there 
is also the question of what outsiders can do when 
ignored by social democratic governments. Starting in 
the 1970s, most Western democracies have experienced 
the emergence of powerful antisystem parties (espe- 
cially extreme right ones). This article’s arguments sug- 
gest that the losers in the labor market arena (outsiders 
who are not the focus of traditional left or right par- 
ties) may have reasons to turn away from mainstream 
options. 

Before closing, I would like to briefly explore three 
topics that may not have received enough attention in 
my analysis: the degree of security enjoyed by insiders, 
the perceived effectiveness of labor market policies, 
and the existence of other policy options. The impor- 
tance of the first issue cannot be understated. This 
article’s partisanship model, after all, is based on a 
fundamental difference in the way insiders and out- 
siders are vulnerable to unemployment. To the extent 
that insiders are protected from unemployment, their 
interests will be significantly different from those of 
outsiders. Factors that increase insiders’ vulnerability 
to unemployment, however, will align their interests 
with those of outsiders. In Rueda (forthcoming) I ex- 
plore this question in some detail. Using a case study 
and an analysis of surveys and macrodata, I show that 
the unemployment vulnerability of insiders is indeed 
a significant factor affecting the likelihood that social 
democratic governments will promote labor market 
policies. A hierarchical analysis of individual prefer- 
ences shows that insiders are significantly more likely 
than outsiders or members of the upscale groups to 
have high preferences for employment protection (as 
shown here in a Jess systematic manner). Insiders, how- 
ever, experience a drastic increase in their labor mar- 
ket policy preferences as soon as they feel vulnerable 
to unemployment. As implied by the insider—outsider 
model, the analysis of aggregate data also shows that 
social democratic governments are more likely to pro- 
mote higher levels of labor market policies only when 
insiders become more vulnerable to unemployment. 

As for the second topic, the economics literature 
provides us with some evidence that higher levels 
of ALMP promote lower levels of unemployment 
(OECD 1994; Jackman, Pissarides, and Savouri 1990). 
The existence of scholarly analyses demonstrating the 
beneficial effects of ALMPs, however, is not as rel- 
evant as the general acceptance by policy-makers of 
ALMP as a tool against unemployment. This accep- 
tance has been widespread in recent years. Transfer- 
ring public resources into active labor market policies 
has been an objective repeatedly endorsed by third 
way social democrats and OECD labor ministers. As 
Martin (1998) points out, it has also become part of 
the EU’s official strategy to decrease unemployment 
since the Essen Summit in December 1994 (12). The 
degree to which the perceived effectiveness of ALMPs 
has influenced social democratic strategies should be 
the focus of further research. It is in any case clear 
that social democratic governments do promote labor 
market policies when insiders are threatened by unem- 
ployment (see Rueda, forthcoming). 
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Regarding the existence of other policy options, a 
commitment to full employment (a traditional Key- 
nesian macroeconomic strategy for social democrats) 
surely poses less of a conflict between insider and out- 
sider preferences. There is, however, an important liter- 
ature showing the difficulties social democratic govern- 
ments face when trying to develop Keynesian policies 
after the early 1970s. Up to that point, social demo- 
cratic efforts to reduce the inequality and insecurity 
of the most vulnerable sectors of the labor market 
while more generally promoting growth and employ- 
ment had been very successful. The challenges posed 
by rational expectations (Alesina 1989) and increasing 
levels of internationalization (Alt 1985) are often iden- 
tified as the reasons for the end of the golden age of so- 
cial democracy. Even accepting the relevance of these 
challenges, some options are still open to social demo- 
cratic governments. Labor market policies (especially 
active ones) can be used by partisan governments to 
promote employment, growth, and equality in an envi- 
ronment that impedes demand management. I want to 
conclude this article by pointing out that it is precisely 
because of the importance of these policies that an 
accurate understanding of insider—outsider differences 
becomes crucial. My analysis emphasizes some of the 
considerable difficulties confronting social democratic 
policy-makers who are interested in equality. The ac- 
knowledgment of these difficulties may be the first step 
in finding truly solidaristic solutions. 


REFERENCES 


Addison, John T., and Jean-Luc Grosso. 1996. “Job Security Provi- 
sions and Employment: Revised Estimates” Industrial Relations 
35 (October): 585—603 

Alesina, Alberto. 1989. “Politics and Business Cycles in Industrial- 
ized Democracies.” Economic Policy 8 (Apnil). 55-98 

Alt, James 1985. “Political Parties, World Demand, and Unemploy- 
ment.” Amencan Political Science Review 79 (December): 1016- 
40. 

Alvarez, Michael, Geoffrey Garrett, and Peter Lange. 1991 “Gov- 
ernment Partsanship, Labor Organization and Macro-Economic 
Performance” American Polttical Science Review 85 (June). 539- 
56. 

Armingeon, Klaus, Michelle Beyeler, and Sarah Menegale. 2002 
Comparative Political Data Set 1960-2001 Institute of Political 
Science, University of Berne. 

Beck, Nathaniel, and Jonathan Katz. 1995 “What to Do (and Not 
to Do) with Trme-Series Cross-Section Data ” American Political 
Science Review 89 (September): 634-47. 

Beck, Nathaniel, and Jonathan Katz. 1996. “Nuisance vs. Substance ” 
Political Analysis 6: 1-36 

Bentolila, Samuel, and Giuseppe Bertola. 1990. “Firing Costs and 
Labour Demand: How Bad Is Eurosclerosis?” Review of Eco- 
nomic Studies 57 (July): 381-402. 

Berger, Suzanne and Michael Piore. 1980. Duahsm and Discon- 
tinuity in Industrial Socteties New York: Cambridge University 
Press. 


Blanchard, O J., et aL 1986. “Employment and Growth ın Europe” 
In Restormg Europe's Prospenty, ed. O. J. Blanchard. Cambridge, 
MA. MIT Press 

Blanchard, O. and L. Summers. 1986. “Hysteresis and the European 
Unemployment Problem.” In S. Fisher (ed.), NBER Macroeco- 
nomic Annual Cambridge: MIT Press. 

Boeri, Tito, Axel Bdrsch-Supan, and Guido Tabellin1 2001. “Would 
You Like to Shnnk the Welfare State? A Survey of European 
Gtzens” Economic Policy 32 (April): 7-50. 


Boix, Carles. 1998. Political Parties, Growth and Equality. New York: 
Cambridge University Press. 

Calmfors, Lars. 1994 “Actrve Labour Market Policy and Unemploy- 
ment: A Framework for the Analysis of Crucial Design Features.” 
OECD Working Papers Paris: OECD. 

Calmfors, Lars. 1993. “Lessons from the Macroeconomic Expenence 
of Sweden,” European Journal of Political Economy, 9 (March): 
25-72. 

Calmfors, Lars and Harald Lang. 1995 “Macroeconomic Effects 
of Active Labour Market Programmes in a Union Wage-Setting 
Model,” The Economic Journal 105 (May): 601-18. 

Cusack, Thomas. 1997. “Partisan Politics and Public Finance.” Public 
Choice 91 (March): 375-395. 

Dalton, Russell. 2002 Citizen Politics. New York: Chatham House. 

Doeringer, Peter B., and Michael Piore. 1971. Internal Labor Markets 
and Manpower Analysis. Lexington, MA‘ Heath 

Dore, Ronald. 1994. “Incomes Policy: Why Now?” In The Return to 
Incomes Policy, ed Ronald Dore, Robert Boyer, and Zoé Mars. 
London’ Pinter. 

Europa Yearbook Various years. London: 

Franrese, Robert. 1998 “The Political Economy of Public Debt” 
Paper presented at the Wallis Conference on Political Economy. 
Northwestern University, November 1998. 

Gabel, Matthew, and John Huber. 2000 “Putting Parties in Their 
Place.” American Journal of Political Science 44 (January). 94-103. 

Garrett, Geoffrey. 1998 Partisan Pohucs in the Global Economy. 
New York Cambridge University Press 

Garrett, Geoffrey, and Peter Lange. 1991. “Political Responses to 
Interdependence: What’s ‘Left’ for the Left?” International Orga- 
nization 45 (Autumn): 539-64. 

Giddens, Anthony 1998. The Third Way. London. Polity Press 

Hibbs, Douglas. 1977. “Political Parties and Macroeconomic 
Theory.” American Political Science Review 71 (December). 
1467-87. 

Hicks, Alexander, and Duane Swank. 1992 “Politica, Institutions 
and Welfare Spending in Industrialrzed Democracies, 1960-82.” 
American Political Saence Review 86 (September): 658-74. 

Hmao, Cheng 1986. Analysis of Panel Data. New York. Cambridge 
University Press. 

Huber, Evelyne, and John Stephens 2001. Development and Cnsts 
of the Welfare State. Chicago: University of Chicago Press. 

Iversen, Torben 1999 Contested Economic Institutions New York’ 
Cambridge University Press. 

Iversen, Torben. 1996. “Power, Flexibility and the Breakdown of 
Centralized Wage Bargaining.” Comparative Politics 28 (July) 
399-436 

Iversen, Torben, and David Soskice. 2001. “An Asset Theory of Social 
Preferences” Amencan Political Science Review 95 (December): 
875-93. 

Jackman, Richard, Christopher Pissandes, and Savvas Savoun. 
1990. “Unemployment Policies.” Economuc Policy 5 (October): 
449-90 

Janoski, Thomas 1990. The Political Economy of Unemployment. 
Berkeley: Unrversity of California Press 

Janoski, Thomas. 1994. “Direct State Intervention in the Labor Mar- 
ket.” In The Comparative Political Economy of the Welfare State, 
ed Thomas Janoski and Alexander Hicks New York: Cambridge 
University Preas. 

King, Desmond. 1995. Actrvely Seeking Work? Chicago: The Univer- 
sity of Chicago Press 

Kitschelt, Herbert. 1994. The Transformation of European Social 
Democracy. New York: Cambndge University Press 

Kitschelt, Herbert 1999. “European Socal Democracy Between Po- 
litical Economy and Electoral Competition.” In Continurty and 

in Contemporary Capitalism, ed. Herbert Kitschelt, Peter 
Lange, Gary Marks, and John Stephens. New York. Cambridge 
University Press. 

Lazear, Edward 1990. “Job Security Provisions and Employment.” 

Quarterly Journal of Economics 105 (August): 699-726 

j Assar, and Dennis Snower. 1988. The Insider-Outnder 
Theory of Employment and Unemployment. Cambridge, Mass: 
MIT Press 


Martin, John 1998. “What Works among Active Labour Market 
Polcies” OECD Labour Market and Social Policy Occasional 
Papers, No. 35. Paris: OECD. 


Insider—Outsider Politics in Industrialized Democracies 


February 2005 





McDonald, Michael and Silvia Mendes. 2001. Partes m Parlia- 
ments and Governments, 1950-1995 Political Science Department, 
Binghamton University—SUNY. 

Moene, Karl Ove, and Michael Wallerstein 2003. “Earnings In- 
equality and Welfare Spending.” World Poistics 55 (July): 485- 
516 

Nickell, Stephen. 1981 “Biases in Dynamic Models with Ford Ef- 
fects,” Econometrica 49 (November): 1417-26. 

OECD 1994. The OECD Jobs Study Paris: OECD 

OECD. 1999. Employment Outlook. Paris: OECD. 

Pontusson, Jonas. 1995 “Explainmg the Decline of European Social 
Democracy” World Politics 47 (July): 495-533. 

Powell, Bingham 1982 Contemporary Democracies. Cambridge, 
MA: Harvard University Presa. 

Powell, Bingham. 2000 Elections as Instruments of Democracy. New 
Haven, CT: Yale University Press. 

Rueda, David. 2001. “Government Partisanship and Economic 
Policy. Insider-Outsider Politics in Industrialized Democracies” 
Ph.D. Diss. Comel University. 

Rueda, David. N.d. “Social Democracy and Actrve Labor Market 
Policies’ Insiders, Gutaiders, and the Politics of Employment Pro- 
motion.” British Journal of Politcal Science. Forthcoming. 


74 


Rueda, David, and Jonas Pontusson. 2000. “Wage Inequality and 
Varieties of Capitalism.” World Politics 52 (April): 350-83. 

Saint-Paul, Gilles 1998 “A Framework for Analysing the Political 
Support for Actrve Labor Market Policy.” Journal of Public Eco- 
nomics 67 ). 151-65. 

Saint-Paul, Gilles. 1996. “Exploring the Political Economy of Labour 
Market Institutions.” Econornic Policy, 23 (October). 264-315 

Scharpf, Fritz. 1991. Criss and Choice m European Social Democ- 
racy. Ithaca, NY: Cornell University Press 

Schwartz, Herman. 1994 “Small States in Big Trouble.” World Poh- 
tics 46 (July): 527-55 

Swank, Duane, and Cathie Jo Martin 2001. “Employers and the 
Welfare State.” Comparative Political Studies 34 (October): 889- 
923. 

Visser, Jelle. 1996. “Unionization Trends Revisited.” Centre for 
Research of European Societies and Industrial Relations, 
Amsterdam. 

Wilensky, Harold 1975. The Welfare State and Equality. Berkeley. 
University of California Press. 

Woldendorp, Jaap, Hans Keman, and In Budge. 2000. Party Govern- 
ment in 48 Democracies (1945-1998). Boston. Kluwer Academic 
Publishers. 


American Political Science Review 


Vol. 99, No.1 February 2005 


Racial Orders in American Political Development 


DESMOND S. KING University of Oxford 


ROGERS M. SMITH University of Pennsylvania 


merican political science has long struggled to deal adequately with issues of race. Many studies 
inaccurately treat their topics as unrelated to race. Many studies of racial issues lack clear 
theoretical accounts of the relationships of race and politics. Drawing on arguments in the 
American political development literature, this essay argues for analyzing race, and American politics 


more broadly, in terms of two evolving, competing “racial institutional orders”: a “white supremacist” 
order and an “egalitarian transformative” order. This conceptual framework can synthesize and unify 


many 


about race and politics that political scientists have advanced, and it can also serve to 


highlight the role of race in political developments that leading scholars have analyzed without attention 
to race. The argument here suggests that no analysts of American politics is likely to be adequate unless 
the impact of these racial orders ıs explicitly considered or their disregard explained. 


hether race is the “American Dilemma,” 
WW aa inequities have been and remain con- 

founding features of U.S, experience. Has 
racial injustice been a great aberration within a fun- 
damentally democratic, rights-respecting regime? Has 
the United States instead been an intrinsically racist 
society? Has racial discrimination been the spawn of 
psychological or cultural pathologies, or a tool of class 
exploitation, or a political “card” to be played in power 
games, or something else? 

One might expect political science in the United 
States to be the center of debates, if not answers, 
on such questions. But American political scientists 
have historically not been much more successful than 
America itself in addressing racial issues. We seek to 
do so by connecting theoretical frameworks emerg- 
ing in the subfield of American political development, 
includiog King (1995), Lieberman (2002), Orren and 
Skowronek (1994, 1996, 1999, 2002), and Smith (1997), 
with insights from scholars of race in other areas of 
political science and other disciplines (e.g., Dawson 
and Cohen 2002, Omi and Winant 1994, and Wacquant 
2002). We argue that American politics has histori- 
cally been constituted in part by two evolving but 
linked “racial institutional orders”: a set of “white 
supremacist” orders and a competing set of “transfor- 
mative egalitarian” orders. Each of these orders has 
had distinct phases, and someday the United States 
may transcend them entirely—though that prospect is 
not in sight. 

This “racial orders” thesis rejects claims that racial 
injustices are aberrations in America, for it elaborates 
how the nation has been pervasively constituted by 
systems of racial hierarchy since its inception. Yet more 
than many approaches, it also captures how those injus- 
tices have been contested by those they have injured 
and by other political institutions and actors. It does 
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not deny that the nation’s “white supremacist” racial 
orders have often served vicious economic exploitation 
or that their persistence reveals psychological and cul- 
tural pathologies. Instead it provides a framework to 
organize empirical evidence of the extent and manner 
in which structures of racial inequalities have been in- 
terwoven with economic as well as gender and religious 
hierarchies and social institutions. 

But more than many scholars, our approach ana- 
lyzes the “political economy” of American racial sys- 
tems by stressing the “political,” not the “economy.” 
We see all political institutional orders as coalitions 
of state institutions and other political actors and or- 
ganizations that seek to secure and exercise governing 
power in demographically, economically, and ideologi- 
cally structured contexts that define the range of oppor- 
tunities open to political actors. “Institutional orders” 
are thus more diversely constituted and loosely bound 
than state agencies; but they are also more institution- 
alized, authoritatively empowered, and enduring than 
many political movements. Racial institutional orders 
are ones in which political actors have adopted (and 
often adapted) racial concepts, commitments, and aims 
in order to help bind together their coalitions and struc- 
ture governing institutions that express and serve the 
interests of their architects. As in any coalition, the 
members of a racial order support it out of varied mo- 
tives. Economic aims are central for many, but others 
seek political power for its own sake, or to quiet social 
anxieties, or to further ideological goals. Leaders hold 
them together by gaining broad agreement on the de- 
sirability of certain publicly authorized arrangements 
that predictably distribute power, status, and resources 
along what are seen as racial lines. Hence these al- 
liances necessarily combine what scholars have often 
treated as distinct “ideational” and “institutional” or- 
ders (cf., e.g., Lieberman 2002, Orren and Skowronek 
2004, and Smith 1997). And though the racial institu- 
tions they create at least seem to serve many members’ 
economic interests, their coalitional nature means that 
their unifying aim must be power for many p 
not just profits (cÊ Goldfield 1997, 30-31, 91). 

By presenting racial orders as political coalitions, 
we build on Omi and Winant’s (1994) depiction 
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of “racial formation” as a product of many elite- 
led “racial projects” (53-76). But in their account, 
political actors or “intellectuals” attacking or defend- 
ing the dominant racial ideology drive racial transfor- 
mations (86). Like many other scholars of American 
political development, we treat political entrepreneurs 
and the preexisting institutional orders in which they 
operate as the key independent variables shaping all 
political change, including racial development. We also 
disagree that, despite some forces working at cross- 
purposes, the American state has preserved “an overall 
unity” as a “racial state,” granting “no political legiti- 
macy” to “oppositional racial ideologies” or “compet- 
ing racially defined political projects” (80, 84). Instead, 
we see the American state as comprised of multiple 
institutional orders, including competing racial orders 
with conflicting ideologies. Though the rival orders 
have always had unequal power, to understand change 
we must recognize both that competing racial orders 
have long existed and that all have included some gov- 
erning institutions. No American racial “project” has 
gone far without aid from some such institutions, and 
no racial conflict can be grasped without seeing how 
these institutions have shaped the sincere aims of the 
actors involved and their strategic calculations. Rather 
than seeing racial change as many sociologists do, as 
“the product of the interaction of racially based social 
movements and the state” (Omi and Winant 1994, 88; 
Wacquant 2002, 52), we see it as the product of the 
interaction of opposing racial orders, as well other po- 
litical orders, all of which include some state institutions 
and some nonstate political actors and organizations. 

The balance of power in those interactions has 
shifted over time in part because, like most politically 
constructed coalitions, America’s racial orders have 
been complex and breakable. Most political actors pos- 
sess partly conflicting identities and interests, and there 
are always many goals they might like to pursue. But 
because preexisting contexts define the problems and 
options actors face, politics usually involves choosing 
sides among two or three major approaches to what 
are widely seen as the dominant issues of the day, 
even if the prevailing approaches and issues do not 
express one’s concerns fully. To accomplish much at all, 
American political actors have generally felt compelled 
to join either their current form of white supremacist 
order or its more egalitarian opponent. This means, 
however, that the competing racial orders have always 
included some members whose alignment was tenta- 
tive and alterable, while others in each era have at least 
sought to remain unaligned or to forge a third direc- 
tion. Because of the limits of politics, the latter choices 
have usually meant effectively aiding one order more 
than the other, or becoming politically unimportant, 
until exceptional circumstances have opened up new 
coalitional options and policy directions. 

In the antebellum era, for example, many supported 
institutions of white supremacy as buttresses to African 
American chattel slavery and the acquisition of Na- 
tive American lands. Others simply wanted institu- 
tional protections against aggrieved nonwhites, or a so- 
cially recognized superior status, while some displayed 
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psychological aversions, even genocidal impulses, to- 
ward people “of color.” Though most of these white 
supremacists sided with slavery when it was the is- 
sue of the day, some did so reluctantly, and others 
opposed it, temporarily allying with advocates of an 
egalitarianism they did not share. Given these internal 
tensions and changing demographic contexts, in order 
to sustain a coalition powerful enough to control key 
governing institutions, antebellum white supremacists 
sometimes had to modify prevailing legal definitions of 
“whiteness,” “blackness,” and other racial categories. 
They slowly concluded that they had to label all with 
any African ancestry “black” and accept the Irish and 
many other immigrants as “white” (Jacobson 1998; 
Williamson 1984, 17-21). Yet they remained largely 
unified around the goal of maintaining the U.S. as a 
“white man’s nation.” 

The internal tensions among those championing ega- 
litarian changes over the content of egalitarian goals 
and the means for pursuing them have been greater 
still. American discourses and institutions promising 
equal rights burgeoned in opposition to British aris- 
tocracy. Initially few British colonists thought them 
inconsistent with African slavery. But from the start, 
many black and some white Americans did; and some 
who opposed slavery favored full racial equality. Yet 
they worked in alliance with many more who were 
antislavery advocates of less extreme forms of white 
supremacy, such as “tutelary” status or colonization 
for nonwhites. And throughout history, many who have 
rejected all versions of white supremacy still have dif- 
fered on whether priority should be given to seek- 
ing economic equity, equal political status, or cultural 
recognition. Hence even when they were allied on is- 
sues such as ending slavery or segregation, advocates 
of racial change have disagreed over whether their ul- 
timate goal should be full integration or some form of 
more egalitarian racial pluralism. Over time there have 
been major shifts in the degree and kinds of egalitari- 
anism that have predominated among reformist insti- 
tutions and actors, defining the phases of the nation’s 
transformative egalitarian racial orders. 

Changes have occurred in part because individuals 
positioned on the margins of racial orders, in relation 
either to the aims or to the power structures of those 
orders, have sometimes switched their dominant alle- 
giances at critical junctures. Such was true of Andrew 
Johnson, who was ardently antislavery but otherwise 
did not favor altering systems of white supremacy; and 
Harry Truman, who had never been a strong racial 
egalitarian but who concluded for domestic, inter- 
national, and personal reasons that it was wiser to ally 
with antisepregationist northern Democrats than white 
supremacist southerners (Klinkner with Smith 1999, 
71-79, 206-224). Some important political actors have 
even operated in two conflicting racial orders at once, 
though in different ways, as in the cases of Booker T. 
Washington and Franklin D. Roosevelt. Washington’s 
Atlanta Exposition speech helped legitimate the rise 
of Jim Crow laws, but his power was always marginal 
within the Jim Crow order, and he secretly funded law- 
suits to oppose it (Harlan 1983, 32-33). To sustain his 
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New Deal coalition and his own centrality, Roosevelt 
let many New Deal programs be structured in ways that 
placated and even strengthened white supremacists. 
But he did not share their aims; so he sought to extend 
economic relief to black Americans, to support their 
legal causes, and to appoint them to higher offices when 
he could (McMahon 2003, 97-202). 

Despite these complexities, in particular settings it is 
not hard to discern what were commonly seen as the 
main proposals on the nation’s agenda that promised 
to increase or decrease racial equality of conditions 
in the near term. Scholars can recognize that issues 
such as slavery, Jim Crow segregation, and racially 
targeted aid programs have at different times been 
the central disputes around which political battle lines 
have formed. Hence in each era scholars can identify 
empirically the main institutions and actors allied to 
sustain the then-dominant forms of white supremacy, 
thereby comprising that period’s “white supremacist” 
order, and the leading institutions and actors working 
for more egalitarian racial conditions, its “transforma- 
tive egalitarian” order. The existence and analytical 
utility of these racial orders are not discredited by the 
presence of internal tensions in the orders, including 
marginal and “dual” members who may change sides, 
by some who seek to stay unaligned, or by the fact that 
the orders modify their goals and members over time. 
Rather, these features add to their explanatory force. 
The processes of change wrought by the problems 
leaders face in sustaining these orders amidst inter- 
nal tensions, by the conflicts of the orders with each 
other, by the defection of actors and institutions from 
one order to its rival, and by their interactions with 
other actors and institutions comprising American life, 
all have been engines of significant political develop- 
ment. 

The “racial orders” approach is a theoretical frame- 
work that can enable empirical studies of racial sys- 
tems to falsify hypotheses. If a racial order works 
against the economic interests of many participants, 
as antebellum laws banning free blacks in Old North- 
west states arguably did for many employers and even 
white workers, as Jim Crow laws clearly did for trans- 
portation companies, and as race-based immigration 
Testrictions probably did for many wealthy support- 
ers, it is hard to claim their economic aims drove 
that order. And if the systems of economic and po- 
litical inequality sanctioned by a racial order come to 
be greatly modified, as in the shattering of the inter- 
weaving of white supremacy and slavery before the 
rise of de jure segregation systems, it is implausible 
to deny that the order has undergone true develop- 
ment. Thus this approach can also help scholars map 
the stages and extent of the nation’s real but incom- 
plete progress toward racial equity and the political 
contests through which progress has come. And insofar 
as our framework can unify and strengthen empirical 
findings on racial developments, it can also vindicate 
the claim that these contests have been fundamentally 
political. 

To sketch the argument developed here: at the 
nation’s founding, a political coalition of Americans 
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formed that gained sufficient power to direct most 
governing institutions, and also economic, legal, educa- 
tional, residential, and social institutions, in ways that 
established a hierarchical order of white supremacy, 
though never without variations, inconsistencies, and 
resistance. That order was so bound up with the insti- 
tutional order of chattel slavery, which it legitimated, 
as to seem identical to it—but never wholly so. White 
supremacist structures often subordinated putatively 
free Native Americans and many “nonwhite” immi- 
grants as well as African Americans. And again, this 
order also always included some white racists, whose 
numbers grew over time, who opposed slavery, seeing 
it as the source of a dangerous black presence. Their 
hope was to “get shut of the Negro,” not to be his 
master. 

Yet at least from the time the Declaration claimed 
that all men are created equal, the nation also dis- 
played an incipient “transformative egalitarian” order, 
one that some soon applied to racial hierarchies. This 
order had its governmental institutionalization in legal 
guarantees of equal rights that were sometimes im- 
plemented in judicial rulings and legislative statutes, 
often under the pressure of religious groups, black 
and white. Initially, to be sure, this political order was 
far weaker than white supremacist actors and insti- 
tutions. Conflicts between proslavery and antislavery 
forces, not white supremacists and racial egalitarians, 
formed the central axis of, especially, late antebellum 
politics. But in context, the antislavery alliance was un- 
deniably a force for egalitarian racial transformation; 
and though some of its partisans were strongly racist, 
and most were only moderately egalitarian, some op- 
posed white supremacy in Congress, on state courts, 
and in state legislatures. The triumph of the antislavery 
forces greatly strengthened the position of these more 
racially egalitarian actors and the institutions they 
occupied. 

After the Civil War, this revised transformative al- 
hance built new constitutional, administrative, politi- 
cal, economic, educational, and social institutions to 
promote greater racial equality. Most important were 
the Fourteenth and Fifteenth Amendments, along with 
new schools, political organizations, and civic associ- 
ations for blacks. But the allies and institutions that 
made up the “white supremacist” racial order were 
far from eradicated. They eventually regained domi- 
nance in the modified form of the Jim Crow system 
of segregation and disfranchisement that largely pre- 
vailed until the civil rights era of the 1960s. Substantial 
further development came only when, in light of new 
national and international political contexts, some who 
had long been complicit in white supremacy, especially 
northern Democrats, chose from mixed tactical and 
moral motives to join heightening black resistance to 
those arrangements. Gaining executive, judicial, and 
finally legislative support, proponents of the nation’s 
resurgent “transformative egalitarian” racial order suc- 
ceeded in discrediting explicitly “white supremacist” 
policies definitively and expanding their own institu- 
tional order more than ever. But despite these tri- 
umphs, still-valorized “antitransformative” institutions 
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of racial inequality have continued to play major roles 
in American life, while leaders of “transformative egal- 
itarian” institutions have struggled more than ever to 
decide what further changes should be pursued. 

Useful as this framework is for making sense of racial 
development, our main claim here is that a “racial in- 
stitutional orders” approach helps explain many fea- 
tures of American politics that may appear unrelated 
to race, such as congressional organization, bureau- 
cratic autonomy, and modern immigration priorities. 
We conclude that the internal developments, clashes, 
and broader impacts of American racial orders have 
been and remain so central that all scholars of Amer- 
ican politics ought always to consider how far “racial 
order” variables affect the phenomena they examine. 
Analysts should inquire whether the activities of insti- 
tutions and actors chiefly concerned either to protect 
or to erode white supremacist arrangements help to 
account for the behavior and changes in the nation’s 
political institutions, coalitions, and contests they study. 
Any choice not to consider racial dimensions requires 
explicit justification. 

This is so precisely because racial orders have been 
constitutively interwoven with many other highly sig- 
nificant institutional orders, including gender and class 
hierarchies. Still, we recognize that African Americans, 
Asian Americans, Latinos, and Native Americans, like 
all others, have had political concerns that are best 
captured by stressing their membership in other such 
orders, not their racial positioning (Reed 2004). We 
hope that the framework we advance here will aid the 
study of all these political orders, providing us with 
ways to identify and measure their profound intersec- 
tions with racial institutions and conflicts. We suspect 
that these intersections will show how the unusual 
prominence of racial orders in America’s development 
has also given distinctive shape to its gender and class 
systems. 

Claims of “distinctiveness” call for comparative 
analyses. A racial orders approach can aid such work. 
For example, Anthony Marx (1999) has persuasively 
contended that efforts to limit combat among whites 
explain why South Africa and the U.S. elaborated for- 
mal white supremacist institutions much more than 
Brazil did. But Marx’s framework, like those of im- 
portant recent sociological works (Bonilla Silva 2001; 
Wacquant 2002), has no explicit category for what 
we term “transformative egalitarian” orders. Focus- 
ing on those orders reveals the linked roles of gov- 
erning institutions and social movements in bringing 
change and thus specifies more fully the conflicting 
power structures that prompted the compromises Marx 
discusses. Our framework also highlights key differ- 
ences between national racial systems. The institution- 
alization of formal segregation laws in one part of 
the United States, combined with only de facto seg- 
regation elsewhere and formal national promises of 
civic equality, meant that opportunities for American 
racial egalitarians were in some respects greater than in 
South Africa. Though our discussion is narrower here, 
a racial orders approach can strengthen broad-ranging 
analyses of intersecting systems of class, gender, 
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power, and inequality in the United States and else- 
where. 


PLACING RACE IN AMERICAN POLITICAL 
DEVELOPMENT 


Work on race has exploded in political science in recent 
decades. Yet leading scholars still see a “need for a firm 
theoretical foundation on which to conduct empiri- 
cal research” into American racial politics (Dawson 
and Cohen 2002, 488-89). We reach the key concept 
of our theoretical framework, “racial institutional or- 
ders,” by combining arguments for the prominence of 
America’s racial traditions in the nation’s “multiple 
traditions” (e.g., Goldfield 1997, King 1995, and Smith 
1997) with the “multiple institutional orders” approach 
to political development advanced by Karen Orren 
and Stephen Skowronek and also explored by histo- 
rical institutionalist scholars (e.g., Lieberman 2002 and 
Pierson and Skocpol 2002). To clarify this frame- 
work, we begin with “institutions,” then “institutional 
orders,” and then “racial” institutional orders. 

We follow Orren and Skowronek (2004, 82-85) in 
defining “inshtutions” as organizations that (1) have 
broad but discernible purposes, (2) establish norms 
and rules, (3) assign roles to participants, and (4) have 
boundaries marking those inside and outside the in- 
sħtutions. A “political” institution seeks to control in- 
dividuals and institutions outside as well as within its 
bounds. A “governing” political institution has widely 
recognized mandates to compel outsiders to follow its 
authority. “States” are congeries of “governing political 
institutions,” but state institutions and actors are often 
in many ways at odds with each other, not unified. 
Hence we focus not on “the state” but on often-clashing 
“institutional orders”—orders that fail to encompass 
the entire state and that include things outside the state. 

We extend Orren and Skowronek’s framework by 
defining an “institutional order” as a coalition of gov- 
erning state institutions, nonstate political institutions, 
and political actors that is bound together by broadly 
similar senses of the goals, rules, roles, and bound- 
aries that members of each order wish to see shap- 
ing political life in certain areas. Such orders contain 
constitutional provisions, statutes, and administratively 
made or judge-made rules that formally establish and 
govern those areas; legislative bodies, executive agen- 
cies, and/or courts involved in implementing those le- 
gal doctrines; and all political parties and civil groups 
that make their positions on how the areas should be 
managed central to their activities. Such groups are 
usually drawn from the economic and social actors and 
organizations regulated and partly constituted by an 
order’s rules. 

“Racial” institutional orders seek and exercise gov- 
erning power in ways that predictably shape people’s 
statuses, resources, and opportunities by their place- 
ment in “racial” categories. The orders rarely originate 
such categories. But their proponents often modify in- 
herited racial conceptions to attract new supporters 
while retaining old ones and stigmatizing opponents. 
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At times “white” has meant most northern Europeans, 
all Europeans, most Middle Easterners, and those with 
one-eighth African or one-fourth Native American an- 
cestry (Jacobson 1998; Kennedy 1959, 48). The shifting 
ways in which actors in racial orders institutionalize 
these categories partly constitute persons’ senses of 
racial identity. 

In this approach, meaningful “development” occurs 
when one predominant order gives way to another, or 
when the prevailing order’s leading concepts of racial 
goals, rules, roles, and boundaries are substantially re- 
vised, as when most white supremacists felt compelled 
to abandon slavery or when most racial egalitarians 
came to insist on equal voting rights, not just civil rights. 
Such development can arise from struggles among 
partly dissonant groups within an institutional order, 
from the interactions of racial orders with other orders, 
such as labor and party systems, and from broad social, 
economic, and international changes, all of which may 
strengthen some participants in an order and weaken 
others, or add new players and policy challenges for the 
order to confront (Orren and Skowronek 2002, 747- 
54; Pierson and Skocpol 2002, 698-704). The “racial 
orders” framework does not postulate priority for any 
of these factors, though if scholars regularly take the 
institutions and actors comprising racial alliances into 
account, we may eventually be able to judge which 
causes have mattered most. 

While the concept of an “institutional order” builds 
on work in the American political development sub- 
field, our focus on racial orders is novel. The subfield 
grew out of the efforts of pioneering scholars including 
Bensel (1984), Skocpol (1979), and Skowronek (1982) 
to “bring the state back in,” a quest that led to the 
“claim that institutions have an independent and for- 
mative influence on politics” (Orren and Skowronek 
1986, vii). But little of that work addressed race, be- 
cause for many, the impetus to grasp the roles of 
“states” and “institutions” came from dissatisfaction 
with analyzing politics in Marxist “class” or liberal 
“group” terms. Those approaches seemed unable to ex- 
plain why America had weaker redistributive and reg- 
ulatory programs than much of Europe. This new his- 
torical institutionalist literature thus had an economy- 
centered sense of “political economy.” Most writers 
also viewed American politics as shaped by a com- 
mon though internally conflicted “liberal” political cul- 
ture (e.g., Greenstone 1986, 1-49). To take one em- 
inent example, Bensel (1984, 4) argued that because 
its “common political culture” meant that “even the 
most serious ethnic or religious disputes have never 
seriously threatened to dismember the nation,” most 
U.S. “interregional competition,” including the Civil 
War, “should be interpreted as a struggle for control 
of the national political economy.” The role of racial 
cleavages in defining sectional identities, goals, and 
conflicts went unexamined.! 


1 Some other examples of important works ın the first decade or so 
of American political development work that give little attention to 
race include Gamm 1989, Hansen 1987, Orloff 1988, Sanders 1986, 
Sklar 1988, Skocpol and Finegold 1982, Tulis 1987, and Valelty 1989. 
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This founding stress on “economic conflicts plus rel- 
atively autonomous state institutions within a liberal 
culture” still characterizes some of the best recent 
American political development scholarship. But just 
as work on race has multiplied in other precincts of po- 
litical science, so, too, have many of these scholars given 
race more attention (see, e.g., Frymer 1999; Goldfield 
1997; Katznelson 1973, 1981; Kryder 2000; Lieberman 
1998, 2002; McMahon 2003; Mink 1986; Valelly 1995). 
Yet though this valuable work has detailed the wide- 
ranging importance of race, it has only begun to pro- 
duce theoretical frameworks adequate for analyzing 
race. 

There remains a strong tendency in the Ameri- 
can political development literature, tracing to Louis 
Hartz (1955), to theorize racial issues as ultimately 
products of the antebellum “master/slave” order (e.g., 
Ericson 1999, 2000; Greenstone 1993, 95-117; Orren 
and Skowronek 2004). This tendency endures in the 
“liberal culture” premise of much early American po- 
litical development work, and for some major topics 
it is appropriate. The slave system was deeply inter- 
woven with American racial patterns, and clashes over 
the extension of slavery formed the chief cause of the 
Civil War. Yet to grasp the politics of race in the ante- 
bellum era; to explain Reconstruction; to analyze later 
racial developments; and to see the impact of racial 
contests on American development generally, “mul- 
tiple orders” analyses must encompass not only this 
“master/slave” order, but, especially, America’s racial 
orders. 

Both evidence and parsimony lead us to posit two 
potent, evolving racial systems at work: first, the set of 
“white supremacist orders” created to defend slavery 
and also the displacement of the tribes. Even the Dec- 
laration of Independence sought to justify tribal sub- 
jugation (by denouncing “merciless Indian Savages”) 
and to avoid criticism of chattel slavery (by excising 
Jefferson’s language attacking the slave trade) (Ellis 
2000, 81-119). From then on, fueled by the spread of 
slavery, by desires to justify westward expansion into 
tribal and Mexican lands as racial “manifest destiny,” 
and, at times, by anxieties about immigrants, political 
leaders extended the nation’s white supremacist order 
into spheres that eventually went well beyond master/ 
slave relationships (Horsman 1981). 

Along with the state and national laws that upheld 
the master/slave order, this white supremacist system 
was comprised by statutes at the national as well as 
the state level, in the north and south, that accorded 
even free black citizens restricted political, economic, 
and mobility rights (Berwanger 1967, 42-49, 140; Fuchs 
1990, 91-93; Litwack 1961, 31, 70-93, 113-15). We 
stress that while most of the actors and institutions 
in the antebellum white supremacist order supported 
slavery, some did not, because the latter played piv- 
otal political roles. In an 1857 Oregon constitutional 
referendum, 74% of Oregon voters rejected permit- 
ting slavery in their state, but 89% favored exclud- 
ing all blacks (Berwanger 1967, 93). Analysts have 
struggled to explain such positions as somehow due to 
white workers’ economic interests, while terming those 
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views a misguided, racist “blind spot” (Goldfield 1997, 
79-80, 92, 108-111). Though worker fears of cheap 
competitors surely played a part, so great an antiblack 
vote in a state seeking immigrant labor indicates that 
many saw all nonwhites as undesirable (Fuchs 1990, 24; 
Hutchinson 1981, 24, 35-39). A racial orders approach 
suggests they did so not only from misconceived eco- 
nomic motives but also out of beliefs in black inferi- 
ority, fear of racial strife, and desires to reserve power 
for those with whom they identified racially. On many 
racial measures these Oregonians could side with slave- 
holders, unless and until the issue became slavery and 
blacks in their own backyard. 

The core of the antebellum white supremacist or- 
der included much of the Senate and most presidents, 
products of electoral systems that overrepresented 
the slave states; most of the state officials in those 
states; and all proslavery Whigs and Jeffersonian, then 
Jacksonian, Democrats. The more tenuous members 
were white supremacist but antislavery Whigs and Free 
Soil Democrats. The order also included politically ac- 
tive church and scholarly leaders who justified slavery 
and white rule (Bay 2000; Takaki 1993, 173-76). This 
institutional order had a broadly shared goal, extend- 
ing white domination. It had rules and norms, includ- 
ing racialized slavery laws and other denials of equal 
tights, that assigned different political, economic, and 
social roles to whites and various nonwhites. And it had 
boundaries, for there were always rival institutions and 
actors. 

This white supremacist order made explicitly racial 
identities seem natural and vital to millions. It ha- 
bituated many it privileged as “white” to think of 
their racial status as a primary feature of their lives 
(Du Bois 1992, 700-01). Its imposed inequalities gave 
many white farmers and workers as well as slavehold- 
ers a sense of economic dependency on the mainte- 
nance of racial restrictions that seemed to make their 
lands, jobs, and wages more secure. It also generated 
in many a sense of racial entitlement, which most de- 
fended in religious and biological terms. It did all these 
things by creating politically powerful institutions in 
which all whites could officially share, along with public 
policies that advantaged whites in relation to blacks, 
even if some whites benefited far more than others. 
All of this made most whom governing institutions 
deemed “white” resistant to radical transformations 
in the white supremacist order, even those who wished 
to see slavery and blacks expunged from America. 

Yet U.S. history displays a rival, expanding “trans- 
formative egalitarian racial order” that built on doc- 
trines and institutions of equal rights also present from 
the nation’s inception. The American revolutionaries 
rejected monarchy and aristocracy for democratic re- 
publics, and systems of primogeniture and entail for 
greater individual powers over property. Then they cre- 
ated many new political, economic, and cultural insti- 
tutions to pursue the broad goals of “equal rights” and 
“equality before the law,” and that provided greater 
equality to European-descended men than ever before. 
Its rules and roles called for courts that upheld equal 
justice, a comparatively broad suffrage, market systems 
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promising equal rights to all participants, equal op- 
portunities for worship, and general adherence to the 
natural rights doctrines used to support these claims 
(Kesler 1998, 13-23). The fuel to ignite those doctrines 
and institutions into conflict with white supremacist 
ones came largely from “free labor” economic and po- 
litical interests, but also from the moral indignation 
that slavery fostered in virtually all blacks and some 
whites. Of the two great antebellum slavery issues, the 
most decisive conflict, opposition to the expansion of 
slavery, was predominantly driven by “free labor” com- 
mitments; but outrage against southern efforts to re- 
capture fugitive slaves came chiefly from allied racially 
egalitarian reformers (Foner 1988, 75—87, 124-26, 543- 
47; Takaki 1993, 129-30; Wiecek 1977, 153-62, 216-48). 

Through this alliance, specifically racial egalitarian 
principles won limited institutionalizations in the an- 
tebellum years, and then growing numbers of activist 
invoked them to champion black rights. These in- 
stitutions included, first, the Revolutionary-era laws 
ending slavery in the north and promoting manumis- 
sion in the south, and the congressional ban on slav- 
ery in the Northwest Territory; then, beginning with 
the 1820 Missouri controversy, congressional advocacy 
and some judicial rulings claiming Article IV “privi- 
leges and immunities” of citizenship for free blacks; 
then the statutory and procedural obstacles to the 
return of fugitive slaves erected by some northern 
legislatures and courts, prodded by more egalitarian 
white and black churches and antislavery organizations 
(Goldfield 1997, 93-104; Morris 1974, 94-120, 219-22; 
Zilversmit 1967, 135-42). Even if white abolitionists 
like William Lloyd Garrison had trouble accepting 
blacks like Frederick Douglass as equals, together their 
efforts were enough to spark repressive responses. The 
Northwest states banned entry of free blacks, and all 
states denied them full civic equality. Still, in most 
northeastern states blacks kept their political, educa- 
tional, and economic statutory rights and used them to 
work for more (Berlin 1974, 225-29, 316-33, 360-64; 
Litwack 1961, 70-75, 84-104, 153-70; McFeely 1991, 
175-77). 

When southern demands forced antislavery white 
supremacists to decide whether to side with more 
racially egalitarian actors and institutions to stop 
slavery’s spread, many did so. Exemplary here is 
Pennsylvania Democrat David Wilmot, author of the 
proviso that sought to ban slavery in the western ter- 
ritories. Wilmot feared slave competition with “free 
white labor,” thought blacks inferior, and hated the 
notion of sharing power with them. He wrote privately 
that whites should never be “ruled by men,” white or 
black, “brought up on the milk of some damn Negro 
wench” (quoted in Berwanger 1967, 125-26). But faced 
by slavery’s expansion, he became a Republican and, 
while urging peace with the south, supported President 
Lincoln as a wartime Senator before withdrawing to a 
judgeship. 

Lincoln’s Republicans, the center of the late ante- 
bellum transformative alliance, came to power cham- 
pioning only an end to slavery in the territories and 
the equal claim of all races to the limited rights of the 
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Declaration of Independence; but those views were 
enough to spur southern secession. Lincoln preferred 
war to slavery’s expansion (Greenstone 1993, 245-85). 
The war’s polarizations and the key military contri- 
butions of blacks then pushed Lincoln and many oth- 
ers farther. Believing that, “as He died to make men 
holy,” they were called to make men free, Republi- 
cans passed statutes and constitutional amendments at 
the state and national levels that banned slavery and 
elaborated a far more extensive and egalitarian racial 
order. After white southerners showed at the war’s end 
that they still sought to oppress the ex-slaves via new 
Black Codes, many angry northerners even supported 
black voting rights. Though few whites saw much eco- 
nomic benefit in paying to aid the freedmen, by the late 
1860s the electoral needs of many white Republicans, 
the interests of their black partisans, and the moral 
goals of the now-prestigious reform groups all pushed 
hard for political equality (Kousser 1974, 27-29, 241- 
42; Valelly 1995). After the fifteenth Amendment in 
1870, the expanded egalitarian order centered on new 
constitutional guarantees of formal racial equality in 
political as well as economic and civil rights. Its propo- 
nents included many congressional Republicans, new 
Reconstruction agencies like the Freedmen’s Bureau 
and the Freedmen’s Saving Bank, many Republican 
state officials including black officeholders, religious 
reform groups, and the numerous new black schools 
and civil groups they helped create (Du Bois 1992, 220- 
32, 599-600; Goldfield 1997, 115-23; W. Nelson 1988, 
123-47). 

Those developments are made more comprehensi- 
ble by adding a “racial orders” stress on power and 
ideological goals to the more traditional emphasis on 
economic interests embodied in the effort of Orren and 
Skowronek (2004, 133-43) to explain the politics of this 
period in terms of allies of former masters contesting 
with allies of former slaves. That approach has difficulty 
making sense of the pivotal actors who were antislavery 
but also white supremacist, like the great foe of Re- 
construction, President Andrew Johnson, and others 
in the new coalition he sought to form. It is hard to use 
Johnson’s economic interests to explain why he sided 
with his old opponents, the former masters, against the 
slaves he had once supported. But when seen as an ef- 
fort to build a new, nonslavery white supremacist order, 
his conduct was not foolish. Slavery was no longer on 
the agenda. Preserving white supremacy against egal- 
itarian reformers was. Most white southerners, most 
Democrats, and even many Republicans welcomed a 
chance to stop the equal rights crusaders’ assaults on 
white privilege. Johnson simply moved too crudely and 
abruptly to succeed (Foner 1988, 216-27, 247-51). 

Nonetheless, from 1867 on, many voters showed 
renewed willingness to support white supremacist 
Democrats. Even after defeating Johnson and enfran- 
chising blacks, many Republicans decided by the early 
1870s that they could advance their power and eco- 
nomic interests best by largely relinquishing support 
for African Americans, beyond maintaining black vot- 
ing rights. Yet in so doing, they permitted hostile white 
supremacist forces to strengthen; and they were also 
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on the unpopular side of the leading nonracial issue 
of the day, whether money policies would favor credit- 
seeking farmers and workers, or creditors. Amidst de- 
clining electoral fortunes, and after a last gasp effort 
to protect black voters, the Federal Elections Bill of 
1890, only helped the Democrats win in 1892, most 
Republicans defected entirely to the resurgent white 
supremacist order. Many joined the Democrats in re- 
pealing Reconstruction-era election Jaws in 1894, and 
Republican justices upheld de jure segregation in 
Plessy v. Ferguson, 163 U.S. 537 (1896) (Foner 1988, 
488-511; Gillette 1979, 18-49; Logan 1965, 43-75, 140- 
41). 

The modified white supremacist order they con- 
structed included not only antiblack Jim Crow mea- 
sures but also similar laws and practices aimed at 
Mexicans in the Southwest, Asians in California, and 
Native Americans in many western states, as well 
as, first, Chinese exclusion, then racially justified im- 
perial conquest and governance of Filipinos, Puerto 
Ricans, and various Pacific Islanders, and eventually 
race-based immigration quotas (Fuchs 1990, 80-86, 
110-22; Ngai 1999). These other forms of domina- 
tion and exclusion differed in some ways from the 
Jim Crow system and could reasonably be viewed as 
relatively autonomous racial orders. But they were 
created via alliances of western whites and northern 
imperialists and immigration restrictionists with the 
southern architects of black segregation, and much of 
their ideological content defined how far those they 
subordinated should be treated like blacks (Jacobson 
1998, 203-45; Takaki 1993, 148-49, 209-13; Williams 
1980). 

Equivocally before, but overwhelmingly after 1892 
up to the 1930s, this “second-phase” white supremacist 
order encompassed majorities in all three branches 
of the federal government, virtually all southern and 
many western officeholders, and many other local offi- 
cials and police forces. It centered on some national and 
many state and local statutes, executive policies, and 
judicial doctrines that enacted explicit forms of racial 
apartheid, race-based immigration and naturalization 
restrictions, and exclusionary housing, educational, and 
employment practices, along with titularly race-neutral 
vagrancy laws, tenant farming rules, criminal statutes, 
and voter registration and jury selection systems ad- 
ministered to maintain white supremacy (and used 
also to disempower poor whites where they failed to 
cooperate). The order was supported by new intellec- 
tual justifications for the “tutelary” subordination of 
less “civilized” and “evolved” nonwhites (Almaguer 
1994; Fuchs 1990, 96-98; Oshinsky 1996, 40-41). That 
content, and the formally neutral measures, show how 
the American systems of white supremacy had to be 
restructured because of the successes of the transfor- 
mative egalitarian order in institutionalizing enduring 
bans on overt racial subordination. But restructured 
as it was, this system of segregation, disfranchisement, 
and immigrant exclusion remained a white supremacist 
order that made largely formal, limited concessions to 
the more egalitarian institutions and actors that oppo- 
sed it. 
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Yet just as the Civil War egalitarians did not suc- 
ceed in removing all institutional bases for the resur- 
gence of the white supremacists, so their new system 
did not eradicate the postwar egalitarian racial or- 
der. The constitutional provisions and some national 
and state statutes remained available for judges will- 
ing to apply them. Often dissenters and sometimes 
even majorities did so, as when the Supreme Court 
upheld birthright citizenship for Chinese Americans 
and struck down laws that overtly banned blacks from 
juries and prevented whites from selling housing to 
blacks (U.S. v. Wong Kim Ark, 169 U.S. 649 [1898]; 
Rogers v. Alabama, 192 U.S. 226 [1904]; Buchanan 
v. Warley, 245 US. 60 [1919]). Both the Declara- 
tion and the Reconstruction amendments and statutes 
also were central to the rhetoric and the tactics of 
black protest organizations such as Ida Wells-Barnett’s 
Negro Fellowship League, William Monroe Trotter’s 
New England Suffrage League, and the NAACP, 
founded in 1909 and soon focused on winning court 
decisions and legislation that could realize the egali- 
tarian principles therein proclaimed. They had white 
allies among journalists like Oswald Garrison Villard, 
urban social workers like Jane Addams, some church 
and labor leaders, socialists, and some Republican leg- 
islators like Congressman Leonidas Dyer, who pushed 
for antilynching legislation (Nieman 1991, 122-30). But 
white supremacists dominated Congress, the executive 
branch, and most courts, so transformative egalitarians 
won little more than sporadic judicial victories in these 
years. 

In the early 1920s hundreds of thousands of alien- 
ated African Americans supported Marcus Garvey’s 
separatist Universal Negro Improvement Association, 
which rejected white supremacy, yet at times made 
common cause with segregationists (Cronon 1969, 189- 
95). If, in creating a network of black vocational 
schools, Garvey’s hero Booker T. Washington tried to 
operate in both racial orders at once, Garvey strove 
instead to create a third, black institutional order. 
Both men inspired millions of African Americans, Still, 
their public acceptance of racial separatism helped le- 
gitimate the Jim Crow system, leading W. E. B. Du 
Bois to charge that they were on balance aiding white 
supremacy (Balfour 2003; Franklin and Moss 1988, 
244-50). On a racial orders analysis, it is hard to dis- 
agree. 

But WWI and industrialization created new de- 
mands for black labor. The ensuing migration of 
blacks to northern industrial cities made them a poten- 
tially important voting bloc and generated new activist 
African American organizations, including the Na- 
tional Negro Congress, Philip Randolph’s Brotherhood 
of Sleeping Car Porters, and some interracial CIO 
unions. The Depression then brought many northern 
liberals into national offices, and the Communist Party 
launched efforts to win rights for black Americans such 
as the famous “Scottsboro Cases.” All these develop- 
ments strengthened transformative egalitarian forces 

in and Moss 1988, 339-48; Meier and Rudwick 
1976, 314-32). Many New Dealers in the Congress, 
the new executive agencies, on courts, and in northern 
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state and local governments wished to include black 
Americans in the economic restructuring they were 


ursuing. 

But aided by the seniority system, white supremacist 
representatives of the one-party Democratic south 
held many key congressional posts and were strongly 
represented in FDR’s executive branch and the courts. 
They also dominated many of the state and local gov- 
ernments that implemented most New Deal programs. 
Hence proponents of Jim Crow were able to gain exclu- 
sionary concessions from Roosevelt and racially liberal 
New Deal policymakers in the design or administra- 
tion of programs like old age insurance, unemployment 
compensation, housing aid, the Civilian Conservation 
Corps, and new labor laws. The concessions meant that 
these new institutions largely maintained the white 
supremacist order, embittering activists like Randolph, 
who concluded that whites should never provide move- 
ment leadership (Bates 2001). African Americans did 
benefit from many New Deal programs over time, 
and through some of his appointments to administra- 
tive agencies and, particularly, to the judiciary, FDR 
made the nation’s conflicting racial orders less unequal 
in power. But he shaped political development most 
with regard to the nation’s economic and federalist 
structures, not its racial ones (Katznelson, Geiger, and 
Kryder 1993; Lieberman 1998; McMahon 2003; 
Quadagno 1994). 

Change came in the post-WWII years through fur- 
ther migration of blacks to the north; a convergence 
of litigative, lobbying, and militant protest actions by 
leading black groups including the NAACP, the Urban 
League, the Southern Christian Leadership Council, 
and others; and Cold War pressures to make the na- 
tion less vulnerable to Communist criticism. In this 
context many northern Democrats in Congress, and 
first Harry Truman and later Presidents Kennedy and 
Johnson, along with the Warren Court, decided that 
both national and partisan interests could best be 
served by repudiating racial segregation and disfran- 
chisement and white supremacist ideologies and re- 
viving the Reconstruction amendments and statutes 
(Dudziak 2000). The manner in which, in response to 
domestic and international pressures, key actors and 
institutions joined the transformative egalitarian order 
seems even more central to explaining change than 
the heightened “resource mobilization” or the adop- 
tion of new concepts of “black identity” by “minority 
movements” rightly stressed by scholars using “social 
movements and the state” frameworks (e.g, McAdam 
1984 and Omi and Winant 1994, 98-99). 

White supremacists in Congress, and in many state 
and local offices, and some judges resisted ferociously, 
but these altered political contexts left them more iso- 
lated and less powerful. The resulting civil rights con- 
gressional and state laws, the executive endorsements 
they received, the expansion of civil rights enforcement 
and administrative capacities they authorized, and the 
supportive judicial rulings of the era all constituted 
genuine development in Orren and Skowronek’s terms. 
They brought the nation’s transformative egalitarian 
racial order to predominance, for it now encompassed 
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the national legislative, executive, and judicial branches 
and included many new officials and institutions, in- 
cluding the Equal Employment Opportunity Commis- 
sion, the HEW Office for Civil Rights, and the Depart- 
ment of Labor’s Office of Contract Compliance. And 
though the struggle had centered on the rights of blacks, 
egalitarian institutions and actors soon expanded their 
aims to include women, Latinos, Native Americans, 
Asian Americans, and the disabled (Skrentny 2002, 8- 
20). 

Yet though overtly white supremacist policies were 
discredited and egalitarian principles widely pro- 
claimed, many whites north and south remained pro- 
tective of arrangements that the Jim Crow system had 
generated, especially de facto residential and school 
racial segregation, various legal limits on national pow- 
ers to interfere with state and local practices, and the 
overrepresentation of whites at the top of most Amer- 
ican political and economic institutions. In addition, 
many white and some black supporters of egalitarian 
change were troubled by the disruptions and resistance 
those changes engendered. Once again, some white re- 
formers felt anxious that they might lose control of the 
extent and pace of social transformations. Many others 
felt daunted or exhausted by the costs of continuing 
the changes the civil rights era had begun, much less by 
the greater ones posed by expansions of those efforts 
to more and more groups. In ascendancy, the modern 
transformative egalitarian alliance soon become more 
uncertain of its agenda than its predecessors had ever 
been. The chief beneficiary of these discontents was 
the Republican Party. The GOP added to its traditional 
base the support of prosegregationist George Wallace 
Democrats as well as formerly Democratic “neo-cons” 
who felt the Sixties had gone too far. Conservatives 
created a coalition that, aided by Ronald Reagan’s 
charisma, eventually came to power in all three fed- 
eral branches, led by southern and western Repub- 
licans such as Newt Gingrich, Trent Lott, William 
Rehnquist, and both the elder and the younger George 
Bush (Carter 1996; O’Reilly 1995). 

One can easily find in modern conservative ranks 
many actors and institutions who once opposed de- 
segregation and modern civil rights laws, as Bush and 
Rehnquist did, and who still oppose policies promis- 
ing near-term decreases in material racial inequali- 
ties. Still, it is not correct to portray this coalition as 
merely a continuation of the old white supremacist or- 
ders. The transformative egalitarian order, once nearly 
invisible, then constitutionally enshrined but politi- 
cally dominated, is now authoritative in American 
law and many governing agencies. As a result, the 
terrain of the clashes between America’s rival racial 
institutional orders has shifted. Most institutional oc- 
cupants of what we see as the current descendant of 
“white supremacist” racial orders steadfastly and sin- 
cerely reject that label—so it is appropriate to describe 
their contemporary variant as an “antitransformative” 
racial order. As Swain (2002) documents, there remain 
overtly white supremacist forces in American life, but 
though they provide some support to the modern anti- 
transformative order, they do not define it. 
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Nonetheless, one major contribution of the “racial 
orders” framework is to compel recognition of a dis- 
turbing reality. The contemporary “antitransforma- 
tive” alliance does have a widely recognized racial 
agenda, though a negative one. Its actors and insti- 
tutions oppose measures explicitly aimed at reducing 
racial inequalities. Because many of the institutional- 
ized inequities in schooling, jobs, education, and polit- 
ical offices created by the Jim Crow order still endure, 
this opposition helps maintain many superior white 
statuses and privileges for the near term, however 
much some who espouse “color-blind” policies hope 
for greater equality in the long run. It is also likely 
that others are attracted to such policies precisely be- 
cause they may slow alterations in segregated, unequal 
arrangements. 

Because racial egalitarians cannot accept that nega- 
tive racial agenda, the context between today’s orders 
has come to center largely on clashes between transfor- 
mative egalitarian actors and institutions who support 
many kinds of direct action to reduce material racial in- 
equalities in the near term—including most Democrats, 
many administrative agencies, some federal and state 
judges, many educational institutions, some corpora- 
tions, and many liberal and minority reform groups— 
and an antitransformative order united by rejection 
of such actions, including most Republican national 
and state elected officials and partisans, many federal 
and state judges, and conservative religious, ethnic, 
corporate, and activist groups. In that contest, each 
of today’s racial orders has strengths and weaknesses. 
The widespread resistance to racial change means that 
American political actors can still make appeals to 
white interests when discussing policies like affirma- 
tive action, racial profiling, housing and school de- 
segregation, and economic aid to minority communi- 
ties. But with egalitarian precepts more deeply insti- 
tutionalized, they now must generally do so indirectly 
or covertly (Gilens 1999; Kinder and Sanders 1996; 
Mendelberg 2001). Rhetorical allegiance to egalitarian 
ideals has become de rigueur—even though that of- 
ten means reinterpreting civil rights slogans such as 
“color-blindness” and “equality of opportunity” to jus- 
tify resistance to changes in unequal racial statuses. 
Many Americans who honestly reject white supremacy 
are receptive to those reinterpretations, feeling that 
principles of equal justice and practical realities limit 
the ways in which governments can legitimately and 
effectively pursue a more racially egalitarian society 
(Sniderman and Piazza 1993). Many still wish to see 
entrenched racial disadvantages reduced; but this re- 
jection of direct efforts leaves few options for doing 
so. The main recent policy response has been unprece- 
dented levels of mass incarceration, overwhelmingly of 
nonwhites (Wacquant 2002). That has not prevented 
significant reversals in racial arenas such as school de- 
segregation, with a smaller percentage of blacks now 
attending majority white schools than at any time since 
1968, with white students the most segregated of all 
groups, and with the Supreme Court voiding most 
forms of affirmative action (Frankenberg, Lee, and 
Orfield 2003, 4, 6; Gratz v. Bollinger, 439 U.S. 249 
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[2003]). Though these are victories for antitransfor- 
mative forces, they do not generate faith that racial 
problems are fading. 

Those victories do show that while many of the newer 
egalitarian institutions and policies of the 1960s and 
early 1970s continue to work for greater racial equality, 
the problems facing modern transformative egalitarian 
forces are great. They have struggled in the last quarter- 
century to find a unifying agenda. Today’s African 
American activists pursue many different “black vi- 
sions” of civic equality and community flourishing 
(Dawson 1994, 2001). For some like Louis Farrakhan, 
the transformative order of the civil rights era has 
failed, so other, more separatist or Black Nationalist 
routes to greater prosperity must be sought. Though 
the Nation of Islam’s control of governing institu- 
tions is too lmited to form a rival racial order, 
its leaders often opt out of working with the mod- 
ern racially egalitarian coalition. Critics call those 
choices misguided, self-marginalizing acquiescence to 
antitransformative forces and urge a traditional civil 
rights agenda. But with much of that agenda for- 
mally accomplished, black leaders face heightened, 
often conflicting demands to focus more on suppressed 
concerns of class, gender, and sexual orientation, rather 
than race alone (Cohen 1999; Reed 1999, 2004). 

The new immigrant groups from Asia and Latin 
America, moreover, confront modern versions of old 
pressures to situate themselves in relation to to- 
day’s transformative egalitarian and antitransforma- 
tive racial orders. If they seem to embrace the latter, by 
seeking to be seen as “model minorities” within what 
they affirm to be a just system of equal opportunities, 
they face condemnation from many civil rights orga- 
nizations and their allies. If they seem to choose the 
former, by joining in calls for reparations, group rights, 
and multiculturalism, they risk getting locked out of 
avenues to social mobility that, for many, were what 
coming to America was all about (Smelser, Wilson, and 
Mitchell 2001; Waters 2001). If they, too, seek somehow 
to opt out of American racial politics or to build orders 
of their own, again those choices may end up helping 
existing systems of racial inequality to remain largely 
unchanged. 

And with antitransformative forces more united 
against most changes than egalitarian actors and in- 
stitutions are in pushing for any changes, both po- 
litical parties have incentives not to pursue policies 
that whites find threatening too ardently or openly 
(Frymer 1999). As a result, if antitransformative forces 
feel compelled to use the language of egalitarianism, 
transformation-minded actors and agencies often feel 
driven to craft policies in tortured ways. This is espe- 
cially so with regard to the “dividing line” issue of this 
era, race-targeted remedies. Many advocates now jus- 
tify such policies by urging the pursuit of “diversity” for 
its own sake. But there can be many kinds of diversity; 
logically if not politically, a focus on racial diversity 
must be defended as a means to get past otherwise 
intractable racial inequities. Yet the power of each ri- 
val racial order today is sufficient to cause many to fear 
either to affirm or to abjure directly goals of more equal 
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racial conditions. Thus egalitarian norms now bar the 
deployment of overtly white supremacist language, but 
when it comes to concrete policies, the modern racially 
egalitarian order often lacks the power to overcome 
its antitransformative opposition (Lieberman 2002; 
Sabbagh 2004; Schuck 2003, 160-69; Skrentny 1996, 
2002). 

We hope it is clear at this point that, just as a racial 
orders framework helps explain much about America’s 
past, many key features of what analysts have deemed 
today’s “postsegregation era” (Reed 1999) or period 
of “advanced marginalization” of African Americans 
(Cohen 1999) can be illuminated by viewing them 
as clashes between the modern, internally conflicted 
“transformative egalitarian” racial order and the resur- 
gent “antitransformative” racial order. But valuable as 
those contributions are, they may be the lesser part of 
the intellectual gains that can come from a racial orders 
approach. From the nation’s inception to the present, 
the impacts of these orders have not been confined 
to each other. The white supremacist and egalitarian 
transformative orders have also interacted with most 
of the nation’s other “multiple orders,” with far-ranging 
consequences for many aspects of American life that 
may seem distant from race. The frequent failure to 
grasp these connections may well represent the greatest 
weakness of political science in grasping the place of 
race in American politics. 


THE UNSEEN IMPACTS OF RACIAL ORDERS 


Many features of the U.S. political system, from na- 
tional powers over commerce and other economic con- 
cerns, to states’ rights and voting nghts, to structures 
of congressional representation, to immigration and 
naturalization, the scope of free expression, criminal 
justice procedures, and much more, have never devel- 
oped apart from pressures to alter or to maintain the 
nation’s racial ordering. The pattern continues today. 
The operations of the federal civil service; the organi- 
zation of Congress; the content of major pieces of social 
policy during the New Deal and the War on Poverty; 
the opposition to “big government,” “tax and spend 
programs,” and “welfare” from the Reagan adminis- 
tration on; and the activism of modern courts, among 
other matters, have all been driven in major ways by 
battles over how far racial hierarchies would be kept 
or changed. 

When political scientists ignore these impacts, or 
analyze them without a suitable theoretical frame- 
work, they often neglect or misunderstand the con- 
duct of actors who are responding to the tensions 
and opportunities generated by America’s racial or- 
ders. As a result, not only are these writings inade- 
quate in their discussions of race, but they fall short in 
their accounts of the apparently nonracial topics they 
address. 

Because it is the literature on which we build, we 
illustrate these shortcomings with some otherwise out- 
standing early and recent contributions to the Amer- 
ican political development field. No example is more 
revealing than Ira Katznelson’s justly influential works 
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on urban politics, which dealt with race more explic- 
itly than any other early subfield writings. Katznelson 
nonetheless adopted explanatory frames that made a 
politics of class, constrained by an overwhelmingly lib- 
eral political culture within a decentralized state, fun- 
damental for explaining the conditions of American 
cities. 

In City Trenches, Katznelson (1981) located the dis- 
tinctiveness of American politics in its separation of 
the politics of work, where class conflicts prevailed, 
from the politics of community, dominated by “ethnic- 
ity, race, and territoriality, rather than class” (6, 19). 
Race thus often appeared functionally equivalent to 
ethnicity and territory and distinct from class and 
workplace issues. And insofar as Katznelson saw race 
as distinctive, he treated the experiences of African 
Americans as the exception that proved the rule of 
the generally class- and culture-determined patterns 
of American politics. As Katznelson put it, “with re- 
spect to race, especially, as opposed to ethnic divisions 
among white workers, the distinctive character of the 
black experience... has created fissures in American 
society that are as deep as, and often deeper than, those 
of class. ... [It is the] one immense exception” (12). 

Despite its immensity, Katznelson (1981) went on to 
embrace Louis Hartz’s account of the liberal national 
character of America, which minimized the significance 
of racism, in order to explain why in America, class 
conflicts were generally workplace matters and were 
moderated even there by a cultural consensus on mar- 
ket institutions within a liberal polity (14-17). He gave 
little attention to workplace racial clashes. Thus in- 
stead of making race a center of analysis, Katznelson’s 
work/community dichotomy perpetuated the Hartzian 
presumption of its marginality to the main, class- and 
culture-centered line of development in the U.S. polit- 
ical economy. 

To its credit, City Trenches did place the politics of 
community, ethnicity, and race on an equal basis with 
those of workplace class divisions. Katznelson also did 
not deny that white workers often supported bans on 
hiring blacks and formed segregated or exclusionary 
unions (Kelley 1990; B. Nelson 2001). But he left largely 
unaddressed just how and why the “exception” of race 
had so often gone “deeper” than class in shaping even 
conduct at work. Our answer is that membership in 
the white supremacist order has long given workers 
a share of power, status, and ideological valorization, 
as well as apparently more secure, if often meager, 
economic rewards. The fact that white workers long 
not only accepted but insisted on racial workplace 
and union configurations strongly suggests that, rather 
than being either consequences or variations of class 
structures, politics long made racial hierarchies con- 
stitutive features of the American political economy. 
By sometimes equating race with ethnicity and most 
often stressing ethnicity as the organizing framework 
for unions, parties, and urban politics, Katznelson did 
not bring out the full character or scope of the nation’s 
white supremacist order. 

In more recent work, Katznelson has provided valu- 
able evidence of the role of that order in shaping con- 
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gressional behavior and New Deal policymaking (e.g., 
Katznelson, Geiger, and Kryder 1993). And even in 
City Trenches, Katznelson gave much more attention 
to race than some of the most justly celebrated recent 
works in American political development, studies that 
have documented the evolution of major institutions 
including the federal bureaucracy, congressional struc- 
tures, and immigration policy. Though these works af- 
ford a wealth of valuable insights, many nonetheless 
exhibit several deficiencies. 

First, they do not employ theoretical frameworks 
that ask whether the governing institutions and po- 
litical behavior they examine have been involved in 
explicitly racial conflicts. Second, partly as a result, 
their historical narratives often omit glaring ways in 
which racial orderings have shaped the institutional 
goals, structures, procedures, and political actions they 
examine, as well as the ways in which those institu- 
tional features and actions have worked to perpetuate 
or reconfigure American racial patterns. Third, many 
scholars neglect, in particular, both the internal and 
the external politics of political communities defined 
and subordinated through racial categories. Because 
African Americans, Latinos, Native Americans, Asian 
Americans, and others have been disempowered his- 
torically, scholars tend to ignore them as politically 
irrelevant, either acquiescent or ineffectual. But pre- 
cisely because racial orders have been so important in 
the United States, the politics within such communities, 
and their often masked resistance to the larger orders 
in American life, have always mattered greatly (Kelley 
1994, 35-53; Yu 2001). 


Bureaucracy 


From the post-Reconstruction years on, federal depart- 
ments such as Agriculture, Interior, Treasury, State, and 
the Post Office bestowed upon African Americans two 
sorts of experiences. First, they discriminated against 
blacks in hiring practices, routinely segregated eu 
and confined them to low-level positions. 
tion came via measures such as the “rule of three” 
whereby for every opening, the U.S. Civil Service Com- 
mission provided a list of three eligible applicants to the 
hiring department, which then picked one. In this way 
an African American candidate could be on the list 
three times without being selected. From 1914, the 
same commission attached a photograph to each name 
included in the list of three eligible, with predictable 
effect. No African Americans rose to senior office. 
Second, federal departments helped to devise, im- 
plement, and monitor the segregationist order legally 
in place between 1896 and 1954. Before the 1950s, fed- 
eral officials, especially middle-rank bureaucrats, as- 
siduously upheld “separate but equal” arrangements 
or introduced them where they had not existed. The 
Federal Bureau of Prisons, for example, from the 1920s 
to the 1950s repeatedly sought to ensure that peniten- 
tiaries guaranteed segregated accommodations, eating, 
and working arrrangements (King 1995). The War De- 
partment was also a notorious enforcer of segregation 
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during WWII (Kryder 2000). We believe, then, that 
federal bureaucracies were profoundly shaped by and 
active in constituting America’s post-Reconstruction 
racial structure, in ways that a racial orders analysis 
can bring out. 

Contrast in this regard Daniel Carpenter’s The 
Forging of Bureaucratic Autonomy (2001), a sterling 
example of American political development themes 
and methods. Carpenter deploys quantitative analy- 
ses as components in three comparative historical nar- 
ratives that provide powerful support for his central 
claim. He argues that executive agency bureaucrats 
achieved policymaking autonomy from “politicians 
and organized interests” earlier and more extensively 
than many scholars have seen (4). 

Yet Carpenter largely omits from his analysis the 
politics that led to the overwhelming confinement 
of African Americans to lower-level positions in the 
postal service; the failure of the Agriculture Depart- 
ment to confront, and its frequent complicity in, the 
exploitation of black tenant farmers; and the Inte- 
rior Department’s toleration of practices that limited 
African Americans access to public lands while it also 
transferred Indian lands to public and private white 
hands.* Such patterns strongly suggest that these bu- 
reaucrats either were not autonomous from the white 
supremacist order and its efforts to resist racial trans- 
formations or were choosing to assist it. Either way, 
the case for bureaucratic autonomy and its significance 
cannot be made fully unless those issues are addressed. 
This point applies to the whole literature “bringing the 
state back in.” Scholars have failed to reflect on how 
far the internal structures and goals of “autonomous” 
agencies have been constituted by racial orders. 

When considering the efforts of Woodrow Wilson’s 
Postmaster General Albert Burleson to squash union 
organization among postal workers (Carpenter 2001, 
170-71), for example, it would be illuminating to ex- 
plore the degree of autonomy we should attribute 
to Burleson’s role in entrenching segregation among 
postal employees, Was it a “divide and conquer” tactic? 
Was it done at Burleson’s discretion, or in response to 
powerful white supremacist congressional committee 
members, or the ardently segregationist president, or 
other political actors? It would also be valuable to con- 
sider how far the employees thus organized on race 
lines helped secure the “white supremacist” order. The 
roles of Burleson and his agency appear not inconsid- 
erable in these regards. The Postmaster General ad- 
vocated increased segregation to President Wilson’s 
cabinet in April 1913, complaining on behalf of white 
mail clerks: “It is very unpleasant for them to work 
in a car with negroes where it is almost impossible to 
have different drinking vessels and different towels, 
or places to wash.” Burleson then promoted measures 
“to segregate white and negro employees in all De- 
partments of Government” (quoted in Cronon 1963, 


2 Carpenter (2001) makes one bnef reference to discnmmation 
against blacks in the postal service (74). For more on the racial 
patterns of farm and land policies, see Foner 1988, 542, 592-98, Fuchs 
1990, 92-98, and Homie 1984, 44-72 
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32). This policy was adopted throughout the federal 
government (NAACP 1913; Weiss 1969, 64-65). 

If the Postmaster General had autonomy in this 
sphere, then his use of it contributed substantially to 
patterns of segregation throughout the federal gov- 
ernment, in ways that helped define racial practices 
throughout American society. That is a feature of bu- 
reaucratic autonomy that seems worth noting. And if 
these bureaucrats were not autonomous in these re- 
gards, if they were complying with powerful southern 
congressmen, the president, or other officials, then Car- 
penter’s argument is seriously undercut. 

The same questions about the character of govern- 
ment complicity in white supremacist arrangements 
can also be asked of the Department of Agriculture, 
for like the Post Office it was, at the time of seg- 
regation’s spread, an old department that had long 
been engaged with African Americans. We might also 
ask whether segregation in labor unions, specifically, 
should be seen more as a cause of discriminatory gov- 
ernmental employment practices, more as a result, or as 
equally shaped by a broader white supremacist order. 
Conversely, we might ask whether the persistence of 
patronage appointments instead of civil service posi- 
tions in public employment worked to strengthen the 
white supremacist order. But no such questions surface 
in Carpenter’s work, because the relationship of bu- 
reaucratic autonomy to racial hierarchies is not raised. 
Mostly race does not make it into the index, let alone 
the text. Hence his case for bureaucratic autonomy 
and its significance is less compelling than it otherwise 


might be. 


Congress 


The political importance of Congress to the United 
State’s racial orders hardly needs elaboration, so its in- 
teractions with those orders might seem an inescapable 
theme of most congressional studies. Many scholars 
have explored, for example, the antebellum concern to 
maintain structures of power in Congress that would 
protect slavery, and the ways that territorial expansion 
challenged those structures (Weingast 1998). Yet Eric 
Schickler’s Disjointed Pluralism (2001) is a historically 
sweeping and methodologically sophisticated account 
of institutional change within Congress that gives little 
attention to this dimension of congressional! politics. 
Schickler makes a strong case for understanding in- 
stitutional change in Congress as “disjointed plural- 
ism.” He argues that a variety of actors, interests, and 
conditions, rather than any single engine of change, 
have long shaped congressional institutional innova- 
tions. Using careful criteria to select four periods of 
significant change from 1890 to the present, his analysis 
finds enduring tensions in conflicting efforts to central- 
ize power in the parties and institutional leadership, 
among strong committees and their chairs, and in de- 
centralizing initiatives from new members of the House 
or Senate wishing to use their chamber for policy or per- 
sonal entrepreneurship (Schickler 2001, 5-18). There 
is no single direction in which those tensions have 
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been resolved and no single winner in the processes 
of change. 

However, it is striking how Schickler’s approach 
leads him to underplay the powerful racial dynamics 
structuring many of the changes he discusses, as well 
as the consequences for America’s racial ordering that 
each involves. All of his four periods (1890-1910, 1919- 
32, 1937-52, and 1970-89) coincide with racial conflicts 
that were brought directly into congressional politics 
and national policymaking. They were never periph- 
eral and often central to the changes in congressional 
behavior Schickler documents, in terms of both causes 
and effects. 

Schickler’s theoretical framework has room for race 
chiefly as a “policy interest,” which may be “ideo- 
logically” as well as sectionally based, though racial 
considerations also play into individual member “elec- 
toral interests” and “party interests” (Schickler 2001, 
11-12). Though otherwise reasonable, this approach 
has the undesirable effect of fragmenting the “white 
supremacist” order and rendering it invisible. It fails 
to grasp theoretically the reality that individual and 
party ideologies, sectional, and electoral interests were 
significantly constituted in these periods by where in- 
dividuals, parties, and the dominant forces in various 
regions stood on the great questions of whether and 
how America’s racial hierarchies were to be preserved 
or dismantled. Hence it makes interests and actors ap- 
pear more “disjointed” than they often were. 

In Schickler’s first period (1890-1910), patterns of 
speakership and committee powers shifted several 
times in important ways. First, Republican House 
Speaker Thomas Reed succeeded in increasing his of- 
fice’s powers and reducing those of the minority party 
in 1890. Then the Democrats gained a majority at 
the next election and dismantled his rules. Then after 
Republican electoral successes, a bipartisan coalition 
strengthened the Rules Committee to make the insti- 
tution operate effectively again. Among other devel- 
opments, the Speaker’s powers then declined, only to 
be reasserted by Joe Cannon, only to have his powers 
restricted again to the benefit of the Rules Committee 
(Schickler 2001, 28-30). 

In discussing these developments, Schickler does 
note that Republican Party interests led many to sup- 
port a bill to protect African American voting rights in 
1890, but that they were sharply divided, and that this 
was the one element of their legislative package they 
failed to enact. Nonetheless, the Democrats denounced 
this “force bill,” among other centralizing measures, at 
the next election and won the majority that enabled 
them to repeal Reed’s rules (Schickler 2001, 34-35, 42- 
43). We would explain those events as products of the 
internal conflicts and electoral decline of the postwar 
transformative egalitarian order. Schickler leaves them 
unexplained, and does not note their consequence: The 
Democrats went on to repeal the last vestiges of Re- 
construction voting rights protections for blacks and 
the Republicans largely abandoned the cause of black 
voting rights (Kousser 1974, 27-29, 49, 241, 363; Valelly 
1995). The GOP soon regained power but strictly as a 
party of economic interests, not civil rights. 


Its members then cooperated to promote congres- 
sional efficiency on a fairly bipartisan basis—the same 
basis on which the nation moved to rebuild the white 
supremacist order in the form of segregation and race- 
based imperialism. With the not-so-coincidental sanc- 
tion of the Court in Plessy v. Ferguson, segregation laws 
spread, the number of lynchings increased steeply, and 
annexations of Puerto Rico and the Philippines ex- 
tended the American white supremacist order to new 
realms while helping to legitimate it at home. In sum, 
Republican efforts in 1890 to restructure Congress on 
behalf of its partisan interests, then aligned with the 
transformative egalitarian order, collided with the na- 
tion’s white supremacist order in ways that contributed 
greatly to the shifts in congressional structure Schickler 
examines. Those shifts contributed in turn to a reposi- 
tioning of the Republican Party in the nation’s racial 
ordering, and the expansion of the white supremacist 
order, in ways Schickler does not examine. 

The realities of this acquiescence by both parties in 
the Jim Crow system meant that during Schickler’s sec- 
ond period, 1919-32, a key effort of resistance by trans- 
formative egalitarian forces, the vigorous NAACP-led 
campaign to enact antilynching legislation, found only 
a few official champions and no success (Zangrando 
1980). Instead, bipartisan organizational reforms re- 
sulted in fewer but stronger committees, making senior- 
ity in terms of committee service, as well as chamber 
membership, a source of great power (Schickler 2001, 
95-98). Though Republicans introduced these changes, 
much support came from white supremacist south- 
ern Democrats, who virtually always won reelection. 
The increased importance of seniority then embedded 
Congress even more firmly in the white supremacist 
order, so that legislative efforts to transform that order 
went for naught. In this period the rules of cloture and 
filibustering in the Senate were also cast. Both became 
bulwarks of white supremacy, used to dilute the 1957 
civil rights bill and to pose the major hurdle to the 1964 
Civil Rights Act. 

Schickler (2001) notes that in his third period, 1937- 
52, the Rules Committee became “the principal insti- 
tutional power base in the House” for a “cross-party 
conservative coalition” consisting of “Dixiecrats” and 
conservative Republicans and that it enabled “south- 
erners to block legislation that just might have driven 
them from the party” (163-66). He also notes that a 
1949 liberal rebellion against the Rules Committee 
only ended up strengthening the power of commit- 
tee chairs, where southern seniority conferred simi- 
lar advantages (178). Yet he does not observe that 
it was consistently legislation threatening the white 
supremacist order, such as antilynching laws, voting 
bills and other civil rights proposals, and economic 
and welfare legislation that might benefit black farm 
and domestic workers that Dimecrats opposed. Re- 
publican conservatives won their support through ally- 
ing with them in support of white supremacy. Or to 
put it more strongly, because race was its linchpin, 
Schickler’s “conservative coalition” should be seen as 
an institutional component of the white supremacist 
order. 
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Schickler instead follows Bensel in presenting south- 
ern interests simply as collective “sectional interests.” 
Perhaps that vague phrase is appropriate, but if so, it 
needs defense, since it cannot by itself explain the Re- 
publican/Diniecrat alliance. The Republican members 
of the coalition were, after all, not from the southern 
“section.” And “sectional interests” can easily be in- 
terpreted like Bensel in economic terms; yet it is at 
least arguable that southern conservativism on labor 
issues and other economic matters were really largely 
racial concerns (Gerstle 2001, 160-68). But race rarely 
surfaces in Schickler’s account of congressional policy 
preferences, party interests, or congressional structur- 
ing. 

If we believe that southern and conservative interests 
stem from attachments to a racial order that cannot 
be analyzed as simply a class system or a sectional 
system, then more than Schickler acknowledges, his 
history provides evidence not just for “disjointed plu- 
ralism,” but for the special role of conflicting racial or- 
ders in shaping congressional structures during much of 
the twentieth century. A portrait of “racist-constrained 
pluralism” conveys a very different sense of congres- 
sional organization than his more open-ended “dis- 
jointed” pluralism. In sum, in theory Schickler employs 
an approach that fragments what are arguably man- 
ifestations of an interlocked white supremacist order 
into a variety of apparently separate interests, and in 
his historical studies he gives only slight attention to 
the interactions of congressional structuring with the 
nation’s clashing racial structures. The result may be to 
portray the congressional forest without recognizing 
the common seeds of many of its trees. 


Immigration 


In common with too many other policy topics, immigra- 
tion policies are often analyzed separately from domes- 
tic racial issues, Yet from the 1882 Chinese Exclusion 
Act, through the Johnson—Reed 1924 Immigration Act 
establishing race-based national origins criteria, to the 
Immigration and Naturalization Act of 1952 affirming 
racial discrimination, domestic racial institutions and 
their proponents have interacted profoundly with im- 
migration policy (Ngai 1999). For many, Chinese ex- 
clusion fit with displacing western American Indians. 
Constitutional endorsement of segregation sustained 
the legitimacy of racial naturalization restrictions. It is 
doubtful that the prorestriction immigration regime, 
initiated in 1882 and in place until 1965, could have ex- 
isted without a white supremacist alliance in Congress 
of southern Democrats and western Republicans, a 
coalition that provided successive chairs of the two 
houses’ Immigration Committees. They gained further 
reinforcement from northeastern nativist elites. ‘These 
“strange bedfellow” alliances show that the racial 
order promoted linkages across diverse political group- 
ings that, in turn, helped maintain that order. Many 
southern segregationists had previously hoped that im- 
migrants would bolster the southern economy without 
threatening white supremacy; but in the late nineteenth 
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century they concluded that their interests were better 
served by allying with nativist westerners and north- 
easterners, who were not nearly so wedded to domina- 
tion of blacks but who needed votes to win immigration 
bans. 

Because they joined western white workers, south- 
ern white landowners, and old money northeasterners, 
these alliances are not easily explained in class terms of 
employers versus employees or allies of former masters 
versus allies of former slaves. They are best seen as 
components of the resurgent white supremacist order 
of the late nineteenth century. Accordingly, opposition 
came chiefly from the embattled champions of the post- 
war egalitarian racial order like Massachusetts Senator 
George Hoar, whose party, class, racial, and regional 
origins were the same as those of many restrictionists, 
but who refused to join with those northeastern Re- 
publicans who were defecting to support for renewed 
white supremacist arrangements (Gerstle 2001, 14-43; 
Tichenor 2002, 103-8). 

Yet many scholars persist in analyzing immigration 
in nonracial economic, cultural, and institutional terms. 
In Dividing Lines (2002), Daniel Tichenor provides the 
most thorough scholarly narrative of American immi- 
gration policy yet available, and his approach is suitably 
more complex. He explains the major shifts in policy 
in terms of four factors: the fragmentation of the U.S. 
polity that permits unequal patterns of access for lob- 
byists, the tendency for politically unusual coalitions to 
form around immigration policy issues, the influence 
of professional experts’ views in policy choices, and the 
effect of international crises. This framework enables 
a comprehensive account of U.S. immigration policies 
from the late eighteenth century to the present. How- 
ever, because it does not include recognition of racial 
orders, it again fails to highlight how these patterns 
of unequal access, unusual coalitions, expert opinion, 
and responses to international pressures have often 
been products of racial aims, alliances, and institutional 
structures. Thus at times Tichenor does not bring out 
fully the roles that actors within those orders have 
played in shaping and contesting the politics of im- 
migration and the ways in which immigration policies 
have in turn affected America’s domestic racial struc- 
tures. 

For instance, Tichenor (2002) recognizes that 
in adopting Chinese exclusion, post-Reconstruction 
Americans put national political institutions “in service 
to traditions of racial hierarchy.” He also traces that de- 
velopment to a “political alliance of the South and the 
West” dedicated to white supremacy (103-4, 113). But 
though Tichenor observes that “Jim Crowism” spread 
during this period, he stresses the institutional dif- 
ferences between national “acquiescence to state and 
local racial practices” in the case of segregation and 
the active national regulatory role in Chinese exclu- 
sion (113). Rather than highlighting how the southern 
and western alliance linked immigration and segrega- 
tion laws into a more potent white supremacist order, 
this presentation makes these developments appear 
relatively distinct. Similarly, writing about the land- 
mark Dillingham Commission on immigration, which 
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published a 42-volume study in 1911 after four years 
of research, Tichenor notes the Commissioners’ judg- 
ment that African Americans occupied the most un- 
sanitary and overcrowded urban dwellings (128-32). 
He does not, however, push through the implications 
of this comparison. To many the report suggested that 
American blacks were even more undesirable sources 
of disease and degradation than the most objection- 
able immigrants. Thus, though the eugenics-inclined 
Dillingham Commission focused on Eastern European 
immigrants, it too had throw-off effects for the nation’s 
white supremacist order, reinforcing the hierarchies in 
which African Americans fared least well (Ovington 
1911, 171). 

Likewise, Tichenor’s account of the national ori- 
gins system implemented from the late 1920s through 
the mid-1960s highlights the influence of its founders’ 
“xenophobic and racist intentions” (147). But Tichenor 
fails to explore African American political responses 
at the time, articulated at annual NAACP conferences 
and in the Association’s lobbying, or the implications 
for black-white configurations of the law’s racial “archi- 
tecture” (Ngai 1999). The NAACP deplored the system 
of “race classification” embodied in the national origins 
quota system as “naive and untechnical” (3), for good 
reason. That system expressed and reinforced the na- 
tion’s domestic racial order, working with, for example, 
the Supreme Court’s decisions upholding naturaliza- 
tion preferences for whites to maintain the vision of the 
United States as an essentially white country. Some 
white supremacists saw in immigration policy a chance 
to achieve greater national “whitening” and even con- 
templated the removal of all African Americans from 
the United States via either forcible or voluntary emi- 
gration. A bill passed by the Senate in 1914 but rejected 
by the House, after intense NAACP lobbying, sought 
to exclude all black immigrants and to permit the exclu- 
sion of African Americans who traveled abroad (King 
2000, 153-55). 

We also cannot adequately analyze the later over- 
turning of the race-based national origins quota system 
without recognizing the important role played by civil 
rights advocates, again best understood as members of 
the then-ascendant “egalitarian transformative” racial 
order, rather than as a coalition bound strictly by class 
or ethnic allegiances. African American intellectuals 
and leaders argued strongly in 1965 that ending the 
national origins system was a necessary part of ending 
American racial discrimination (Yu 2001). Tichenor 
(2002) notes the NAACP’s pro-immigrant stance and 
includes the “new civil rights zeitgeist” (205, 217) as 
a factor contributing to the 1965 policy shift. But in 
the absence of a racial orders analysis, that zeitgeist 
appears either exogenous to or a minor part of im- 
migration politics, rather than a product of ongoing 
struggles between racial orders in which immigration 
rules had always been central. 

And though the old quota system codified theories 
of racial and ethnic differences more explicitly than 
modern laws do, current policies have fostered dis- 
courses about the “new immigrants,” principally Asian 
Americans and Hispanics, that perpetuate stereotyped 
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notions of racial identities. Contemporary debates re- 
peatedly air implicit or explicit concerns about how 
these groups will fit into or reconfigure America’s 
stiull-contesting racial systems. These, again, are con- 
sequences that, lacking a theoretical focus on how im- 
Migration interacts with racial orders, Tichenor does 
not explore. Claire J. Kim (1999) has argued persua- 
sively that such immigrants often face a challenge of 
“racial triangulation,” in which they feel compelled 
to gain a satisfactory position on a racial field de- 
fined chiefly in terms of black and white identities. 
In our view, such triangulation is both inescapable 
and perilous. It reflects pressures to choose sides be- 
tween the nation’s competing racial orders, with costs 
associated either with choice or with opting for 

“neutrality.” The politics of immigration and new immi- 
grants, like issues of bureaucracy, congressional struc- 
ture, and many other topics, thus can be better grasped 
if we consider the roles of racial orders in shaping 
past and current political institutions, policies, and 
conduct. 


CONCLUSION 


Today, racially inflected contests in courts, legislatures, 
electoral campaigns, and popular discussions over af- 
firmative action; school and residential segregation; 
felon disfranchisement; majority—minority districts; 
racial profiling; the disparate racial impact of incar- 
cerations and the death penalty; hate crimes; repara- 
tions for slavery; Native American rights; immigration 
policies; bilingualism; multiculturalism; “model minor- 
ity” stereotyping; and racial discrimination in hous- 
ing, auto, and credit markets, and in hiring and pro- 
motions, all still roil American political waters. Many 
putatively nonracial issues, such as restraints on free 
speech, vouchers for private schools, the revival of 
federalism, and disputes over public health, environ- 
mental, and social assistance policies, all continue to 
be shaped by race-related struggles. Few of these is- 
sues, or the wider developments with which they are 
linked, can be understood without exploring the en- 
during tensions between and within the nation’s racial 
orders. 

Our argument has not been that race explains every- 
thing in American politics, or even that race is always 
important for every dimension of American political 
development. Many of the apparently nonracial issues 
just listed, and many more, are indeed heavily shaped 
by other concerns. But we maintain that the internal 
dynamics of American racial orders, and their interac- 
tions with each other and with other aspects of Amer- 
ican political life, have so often been so important that 
the question of what role race may be playing should 
always be part of political science inquiries. The failure 
of political scientists to deal adequately with race in 
their scholarship has been all too much a part of the 
failure of Americans to deal adequately with race in 
their common lives. That is why this failure is one that 
our discipline has a special need, and a special duty, to 
rectify. 
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olitical scientists know remarkably little about the extent to which legislatures are able to influence 

policymaking in parliamentary democracies. In this article, we focus on the influence of legislative 

institutions in periods of coalition government. We show that multiparty governments are plagued 
by “agency” problems created by delegation to cabinet ministers that increase in severity on issues that 
divide the coalition. We also argue that the process of legislative review presents an important—but 
understudied—institutional opportunity for coalition partners to overcome these tensions. We evaluate 
our argument using original legislative data on over 300 government bills collected from two parliamen- 
tary democracies. The central implication of our findings is that legislatures play a more important role 
in parliamentary democracies than is usually appreciated by providing a key institutional mechanism 
that allows coalition partners with divergent preferences to govern successfully. 


play a significant role in policymaking? If so, 

are all legislative actors equally able to exercise 
policy influence? In what circumstances will those who 
can affect legislative outcomes actually choose to do 
so? These questions are central to an understanding of 
representative government in parliamentary democ- 
racies. In parliamentary systems, legislatures are typi- 
cally the only political institutions at the national leve] 
that are directly elected by—and thus accountable to— 
citizens. Furthermore, with only rare exceptions, par- 
laments must approve all major policy initiatives. And 
yet, scholars know remarkably little about the extent 
to which legislatures in parliamentary systems matter. 
Instead, political scientists with an interest in policy- 
making in these systems have placed primary empha- 
sis on understanding politics at the government (cab- 
inet) level (see Gamm and Huber 2002, 323). In one 
sense, this focus on governments is natural. The govern- 
ment appears dominant in the policymaking process, 
at least with respect to the introduction of legislation 
(Andeweg and Nijzink 1995, 171). Moreover, since 
elections usually do not yield a legislative majority for a 
single party, political elites must engage in a process of 
complex bargaining that typically results in the forma- 
tion of multiparty coalitions, which must then strive to 
retain the confidence of a majority of legislators for the 
remainder of the parliamentary term. This combina- 
tion of facts readily suggests that to understand policy- 
making in parliamentary systems, one must understand 
above all else the formation and dissolution of coalition 
governments. Thus, over the last four decades, increas- 
ingly sophisticated theoretical and empirical accounts 
of the “birth” and “death” of coalitions have come 
to dominate the literature on parliamentary politics, 
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Importantly, these accounts have largely abstracted 
away from the continuing interactions between coali- 
tion partners in the legislative process.! 

In this article, we argue that a stronger focus on 
legislative politics in parliamentary systems is indis- 
pensable for understanding policymaking in periods 
of coalition government. In particular, we argue that 
multiparty governments make use of legislative review 
procedures to resolve a central tension they face dur- 
ing their tenure—the need for partners with divergent 
preferences to make joint policy. In making our argu- 
ment, we draw on, and contribute to, an extensive liter- 
ature on the delegation problems inherent to modern 
democratic government. While several scholars have 
written on the relationship between citizens and gov- 
ernment officials (e.g., Lupia and McCubbins 2000 and 
Strøm 2000), or between elected officials and bureau- 
crats (Huber and Shipan 2002), our argument focuses 
on agency problems that emerge between parties in 
a coalition. A number of recent studies have focused 
on the same tension (e.g., Müller and Strøm 2000 and 
Thies 2001) and have highlighted the important func- 
tions of cabinet-level institutions in mitigating agency 
problems within a coalition. Our argument differs from 
these studies in a significant way. While we do not deny 
the importance of cabinet-level institutions, we con- 
tend that coalition parties use legislative institutions 
for similar purposes. As a result, one of the central 
conclusions of our study is that legislatures in parlia- 
mentary systems play a much more influential role than 
is typically acknowledged in the comparative litera- 
ture. Importantly, unlike studies that have emphasized 
that legislatures might play a significant role in parlia- 
mentary systems because they strengthen opposition 
parties (e.g., Powell 2000, Saalfeld 2000, and Strgm 
1990), this article focuses on the central role that parlia- 
mentary institutions play in strengthening government 
parties that have been forced to delegate important 
agenda-setting powers to their coalition partners. 

We begin the study with a simple model of the inter- 
actions between coalition partners that demonstrates 


1 Important exceptions to this general trend include the work of 


Carrubba and Volden (2000), Heller (2001), and Huber (1996). 
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that multiparty governments are likely to be plagued 
by “agency” problems created by delegation to cab- 
inet ministers that increase in severity on issues that 
divide the coalition. We argue that the process of leg- 
islative review presents an important—but understud- 
ied—institutional opportunity for coalition partners to 
overcome these tensions. We then provide an empiri- 
cal evaluation of our argument using original data col- 
lected from two parliamentary democracies that details 
how government bills have been altered in the legisla- 
tive process. 


COALITION GOVERNMENT, DELEGATION, 
AND AGENCY PROBLEMS 


A central reason for focusing on the “Life” of a coalition 
between formation and termination is that the ongoing 
policymaking process in coalition governments poses 
challenges that are typically more muted in single-party 
governments. Two features in particular make policy- 
making by coalitions problematic. The first is that coali- 
tion government requires parties with preferences that 
may diverge more or less on specific issues to pursue a 
common policy. As Strøm and Muller (2000) argue, 


Coalition governments manifest some sort of bargain 
struck between their participating parties. Political parties, 
or more specifically their leaders, are likely to have differ- 
ent preferences over a broad range of issues, driven at least 
in part by their motivation to secure for themselves policy 
benefits, office benefits, and electoral advantage. All of 
these goods are likely to be scarce, so that not all demands 
can be satisfied simultaneously.... Electoral advantage is 
even more strictly constant-sum. Ultimately, one party’s 
gain has to be someone else’s loss. In some cases, all in- 
cumbent parties may be able to gain at the expense of the 
opposition, but, much more commonly, coalition parties 
are in some sort of mutual competition for votes. (257) 


Thus, parties that participate in a coalition are en- 
gaged ina “mixed motive” game. On the one hand, they 
have reason to cooperate with their partners to pursue 
successful common policies. On the other hand, each 
party faces strong incentives to move policy in ways that 
appeal to party members and to the constituencies on 
which the party relies for support. Moreover, electoral 
calculations provide reasons to appear to work toward 
providing such policies even when party leaders realize 
that they may not be able to deliver them in practice. 
In short, the policy and position-taking incentives of 
coalition parties continually put them in competition 
with one another. 

Managing the potential conflict engendered by these 
divergent preferences becomes a central problem be- 
cause of a second feature of coalition government— 
delegation. Since the emergence of “modern” par- 
liamentary government, members of the legislature 
have increasingly entrusted substantial authority to the 
prime minister and the cabinet (see Bagehot [1872] 
1978, Bryce 1921, Cox 1987, and Wheare 1963). This 
delegation to the cabinet is most evident in the fact 
that legislative activity in parliamentary systems is 
centered around government bills (see Andeweg and 
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Nijzink 1995, 171, as well as Gamm and Huber 2002, 
323). However, the chain of delegation does not end 
here. While government bills usually require cabinet 
approval, cabinets do not “collectively” draft bills. The 
relatively small size of the cabinet coupled with a high 
collective workload means that the precise content and 
wording of a bill are usually decided by the cabinet 
minister under whose jurisdiction a bill falls (though 
the minister, of course, may work closely with civil 
servants in the ministry). In other words, coalition 
government typically involves delegation that provides 
considerable discretion to ministers in drafting legisla- 
tion (Gallagher, Laver, and Mair 2001, 56; Laver and 
Shepsle 1994, 1996). 

Inherently, for coalition parties that are not affiliated 
with a particular minister, delegation creates risks.” 
Ministers may attempt to make use of their discre- 
tion over initial government bills to move policy in 
directions they favor but that are detrimental to the 
interests of other coalition members. Even where min- 
isters do not expect to be successful in implementing 
a particular draft policy, they may want to use their 
ability to introduce draft legislation to “take a stand” 
that provides electoral benefits to their party (see 
Huber 1996). Other coalition parties, in turn, face 
strong incentives to scrutinize the initial legislative 
proposals made by individual ministers and to coun- 
teract, where necessary, the detrimental effects (from 
their perspective) of ministerial discretion. In other 
words, coalition parties face strong incentives to moni- 
tor, and if necessary to correct, the actions of “hostile” 
ministers.’ The thrust of our argument will be that (un- 
der the right circumstances) legislative scrutiny plays 
a central role in this process. Before turning to the 
particulars of parliamentary scrutiny, we formalize the 
delegation problem confronting coalition governments 
and discuss the circumstances under which ministers 
are likely to (abuse their discretion as well as the 
circumstances under which other members of the coali- 
tion will try to counteract ministerial attempts to move 
policy or to take a position. 


MODELING COALITION POLICYMAKING 


In modeling the “delegation” problem confronting 
coalition governments, we focus on the tension 


2 To isolate the impact of tensions among coalition partners, we 
treat parties as unitary actors and abstract away from delegation 
problems that may exst within a party (e.g, Saalfeld 2000}. Sumilarty, 
we abstract away from principal-agent problems between ministers 
and bureaucrats (see Huber and Shipan 2002). 

3 The problem of delegation posed by munisterial discretion shares 
many features of the conventional “princrpal-agent” framework 
(Kiewiet and McCubbms 1991) that has been used to discuss agency 
problems between citizens and government officials (Lupia and 
McCubbins 2000, Strøm 2000) and between ministers and bureau- 
crats (Huber and Shipan 2002). Our approach differs ın that we 
place primary emphasis not on asymunetric information (which cre- 
ates opportunities for shirking in the traditional framework) but on 
incentives for munistenal position-taking. Nevertheless, the funda- 
mental tension 1s the same. Ministers, acting as agents of the cabinet 
as a whole, may not work fully ın the interests of all of their principals, 
thus creating the need for ex post oversight and correction. 
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created by the need for coalition partners to agree on 
government policy,‘ the policy and position-taking in- 
centives of individual coalition parties, and the need to 
delegate to ministers. Consider a coalition composed 
of two parties (Party A and Party B), confronting a 
one-dimensional policy space.° Assume that Party A’s 
ideal point is located at 0 and Party B’s ideal point 
is located at X > 0 (thus, as X increases, the policy 
divergence between the two parties grows). Because 
they can only adopt one policy on the issue at hand, 
the two parties must agree on a policy to implement. 
Naturally, we would expect the parties to agree on a 
compromise inside the Pareto set (defined by [0, XJ), 
but the precise location of the “coalition compromise” 
will be determined by a bargaining process between 
the two parties. Although clearly central to an under- 
standing of coalition government, we are not directly 
concerned with this bargaining problem here. Rather, 
we are interested in the derived problem posed by min- 
isterial discretion. Therefore, we will assume an exoge- 
nously defined “coalition compromise” in the Pareto 
set, piven by X/t, where t > 1. When t = 1, the com- 
promise position is equal to the ideal policy of Party B. 
As t increases, the compromise position moves to- 
ward the ideal point of Party A, e.g., at t= 2, the 
compromise solution “splits the difference” between 
the parties.® 

The coalition compromise X/t can be interpreted in 
a number of ways. Most obviously, multiparty govern- 
ments usually negotiate detailed coalition agreements 
and government policy declarations at the outset of 
their tenure (Strøm and Müller 2000; see also the es- 
says in Müller and Strøm 2000). The policy positions 
enshrined in these documents can be thought of as 
the “compromise positions” to which the parties have 
agreed and that the coalition has pledged itself to im- 
plement. Alternatively, for issues that are not negoti- 
ated by parties ex ante and that emerge over the life of 
the coalition, we can think of the coalition compromise 
as the policy that would emerge if the coalition parties 
engage in a “full-fledged” bargaining process, making 
use of the resources each party has available in trying to 
shape the final policy outcome. We assume that parties 
have the same expectations about the location of the 
compromise policy. 

The central tension inherent in coalition government 


that we want to explore concerns the implementation 


‘ At the extreme, coalition partners can, of course, choose to leave a 
coalition rather than agree on a common compromise policy (defined 
below). Implicitly, we assume that this “partiapation constraint” us 
met. Allowing exit would not substantially change the predictions of 
the model. 

> Substantively, the focus on one dimension captures the fact that 
policymaking is typically departmentalrzed along jurisdictional Imes 
with separate ministries drafting legislation that addresses specific 
policy issues (see Laver and Shepsle 1996) It also simplifies the 
analysis. However, the argument extends to multidimensional policy 


spaces 

6 For example, if the well-known proportionality norm that appears 
to govern portfolio allocation (Browne and Franklin 1973) also ap- 
plies directly to policy bargaimng, t would equal the reciprocal of 
the seat share contributed to the coalition’s parliamentary weight by 
Party B. 
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of these coalition compromises. This tension arises 
because delegation within the cabinet implies that min- 
isters (aided, of course, by bureaucrats within the re- 
spective ministries) play a leading role in drafting the 
required legislation. Ministers may be tempted to ex- 
ploit their position in the policymaking process to move 
policy away from the agreed-on compromise. Antici- 
pating such “shirking,” other coalition partners in turn 
may have incentives to monitor, and perhaps to coun- 
teract, the work of ministers. In short, delegation to 
ministers creates agency problems for coalition gov- 
ernments that may require institutionalized solutions 
for “holding ministers accountable.” Consider the fol- 
lowing simple game that captures this dynamic: 


1. A minister belonging to Party A introduces a policy 
proposal w e P. 

2. In response to A’s proposal, Party B has two courses 
of action available. It can choose to accept the pro- 
posal, or it can expend resources to insist on the 
coalition compromise X/t and to amend A’s pro- 
posal accordingly. Doing so demands a resource cost 
c> 0. 

3. The final policy that is adopted, p, depends on B’s 
reaction to A’s proposal. If B expends no effort, 
the original proposal is implemented and p = w. If 
B chooses to challenge A’s proposal, the coalition 
compromise is implemented and p = X/t. We as- 
sume that Party A must pay a cost of d> 0 if its 
original proposal is overturned.® 


We assume that parties are motivated by three 
considerations: the policy that is adopted, potential 
position-taking benefits, and the costs that are incurred 
in the policymaking process. Because Party B does not 
draft the original bill, we assume that this party’s payoff 
depends simply on the final policy that is adopted, net 
of any costs that may be expended to challenge and 
amend the minister’s original bill. As is common in 
spatial models, we treat policy preferences as a linear 
function of the Euclidean distance between the ideal 
point of the party (X) and the final policy (p) that is 
adopted:? 


—|X—+|—c if B chooses to rewrite 
ae party A’s bill and p = @, 
P — )-|X—wl if B chooses to accept A’s bill 
and p = w. 


7 Note that the model makes no assumption about the precise ın- 
stitutional structure in which this “amendment process” takes place. 
As we explain below, parties can engage in amendment activities in a 
vanety of institutional settings. The specific focus of our investigaton 
will be the legislative process The parameter c, discussed in more 
detail below, captures resource and opportunity costs incurred in 
challenging the ministerial draft (see also Strøm 2000, 272) 

8 Thus, d captures opportunity costs that have been incurred in draft- 
ing a proposal that ultimately does not become law as well as any 
“audience” costs that the drafting munister/party might suffer by 
appearing unable to implement the onginal version of the bill 

? Euchdean preferences are risk-neutral. However, the results do 
not hinge on this assumption The substance of the results remains 
the same with risk-averse preferences, although the equilibrium con- 
ditions become more complex. 
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Party A, which introduces the original government 
bill, faces a slightly different situation. Like Party B, it 
derives a payoff from the final policy that is adopted, 
but additionally, the act of introducing the initial “gov- 
ernment bill” provides Party A with the opportunity 
to reap position-taking benefits. The “unveiling” of a 
legislative initiative by a minister generally receives 
considerable media attention, and parties can use bills 
they initiated to “claim credit” during election cam- 
paigns. As a result, by introducing a bill that corre- 
sponds closely to its own (and its constituents’) ideal 
policy, Party A can distinguish its proposed policy from 
the coalition compromise and claim credit for attempt- 
ing to pull policy toward the preferred outcome of its 
constituents. We incorporate these added incentives 
for the proposing minister by assuming that there are 
two policy-based components to A’s utility function. 
The first term captures the distance between the final 
policy (p) and Party A’s ideal point, while the second 
term captures the distance between Party A’s ideal 
point and the original proposal (w) made by Party A. 
This term is weighted by the parameter a > 0, which 
captures the position taking benefits that are available 
to the minister’s party. As a increases, Party A is pun- 
ished more severely for making a proposal that deviates 
from its ideal point, and consequently, the importance 
of position-taking increases relative to the final policy 
that is adopted. Finally, we assume that having its draft 
amended by Party B imposes a cost of d > 0 on Party A. 
Thus, Party A’s preferences are given by 


—|2|—a|w|—d_ if B chooses to rewrite A’s 
bill and p = Ž 


t ? 
if B chooses to accept 
party A’s bill and p = w. 


U, = 
4 -0 +a) lw] 


The game has a unique subgame-perfect equilibrium. 
Before presenting the equilibrium strategies, it is useful 
to define two thresholds (the substantive interpretation 
of these thresholds is discussed below). Specifically, the 
“position-taking threshold” is defined by 


$- t(d+ (1 +a)c) 


(1) 
a 
and the “amendment threshold” by 
X 
Ü = — — c. (2) 


t 
The following proposition summarizes the equilib- 
rium strategies: 


Proposition: The following strategy profile constitutes 
a subgame-perfect equilibrium. 


Party A: If © <0,set w = 0. If d > Oand X > Ê, set 
w = 0. If ù > Oand X < X, set w = Ù. 
Party B: If w > w@, accept w. If w < Ñ, challenge and 
amend A's bill 
Consider the intuition underlying this equilibrium. 
Because rewriting the proposal made by Party A is 
costly, Party B is willing to accept some deviation from 
the coalition compromise policy. How much “drift” B 
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is willing to tolerate depends directly on the costli- 
ness of “correcting” the minister’s bill relative to the 
“damage” imposed by ministerial divergence. The 
“amendment threshold” (Ñ) captures this tradeoff. As 
long as Party A’s proposal lies above this threshold, B 
is willing to accept the minister’s draft bill. Note that 
when the policy divergence between the two parties is 
sufficiently modest relative to the cost of amending the 
ministerial proposal, c, the threshold may be negative. 
In this case, Party A can implement its most preferred 
policy without being challenged. That is, the cost of 
challenging the minister may be so large relative to 
policy divisions between the parties that Party B is 
willing to tolerate Party A’s ideal point rather than to 
insist on the coalition compromise of X/t. Thus, we 
observe the kind of ministerial autonomy portrayed by 
the Laver and Shepsle model (1996), in which parties 
are essentially able to dictate policy in the jurisdictions 
under their control. In other words, the Laver and 
Shepsle model, which predicts ministerial autonomy 
for all levels of policy divergence, emerges as a special 
case in our model if c is so large that w < 0 for all 
coalitions. For c small enough that w > 0, however, 
Party B will find it worthwhile to challenge and correct 
the ministerial draft if Party A’s draft deviates “too far” 
from the coalition compromise (i.e., if the draft bill falls 
below the amendment threshold). 

Party A, in turn, faces the following consideration. 
It could draft a bill that does not induce Party B to 
challenge and rewrite the bill by making a proposal 
that only approximates the coalition compromise but 
that is nonetheless acceptable to Party B. Because there 
always exists such a proposal that is closer to A’s ideal 
point than the coalition compromise, Party A is able 
(within limits) to exploit its agenda-setting power to 
bias policy in its favor. In this case, we observe some 
degree of ministerial drift (but not autonomy) and no 
challenges by the coalition partner. 

The drawback of such a proposal is that it may not 
maximize position-taking benefits because the minister 
may have to introduce a draft that deviates from A’s 
ideal point (if # > 0) to avoid a challenge from its 
coalition partner. In such a situation, Party A could 
pursue the alternative strategy of introducing a bill 
that maximizes position-taking benefits (setting w = 0) 
but that will be amended to the coalition compromise 
by Party B. The “position-taking threshold” indicates 
when Party A will choose to pursue this second al- 
ternative. Importantly, Party A will choose to “take a 
position” (thereby triggering an effort by B to revise 
the minister’s draft) when the policy distance between 
the parties grows sufficiently large (i.e., X > £). Thatis, 
substantial policy divergence within the coalition will 
trigger ministerial drift from the coalition compromise 
that is sufficiently large that other coalition members 
will expend effort to reign in ministerial discretion. 


CHALLENGING MINISTERIAL DISCRETION 


The model suggests that whether the tension cre- 
ated by delegation to cabinet ministers becomes 
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“problematic” for multiparty governments is largely a 
function of policy divisions within the coalition. If the 
coalition is sufficiently divided, ministers face strong 
incentives to deviate from coalition compromises to 
“take a position,” triggering attempts by other coali- 
tion members to challenge and amend draft legislation 
to bring it in line with the coalition compromise. In 
the next section, we present a systematic test of this 
prediction. First, however, we need to unpack the no- 
tion of “challenging and amending” ministerial drafts. 
‘The claim that coalition partners possess resources that 
allow them to challenge and amend the draft legislation 
introduced by ministers is central to the analysis. But 
what are these resources? What avenues can coalition 
parties pursue in counteracting ministerial discretion? 

A prominent solution that has recently received 
some attention from scholars focuses on institutional 
mechanisms that operate at the cabinet level. For ex- 
ample, Thies (2001) has argued that coalition partners 
can make use of “junior ministers” (JMs) to shadow 
ministers belonging to other parties. Similarly, the es- 
says in Miiller and Strgm 2000 provide some detail on 
the use of JMs, as well as the use of cabinet committees 
and “inner cabinets” to manage the potential conflict 
created by the combination of ministerial discretion 
and divergent coalition preferences. The model devel- 
oped above is directly relevant for such cabinet-level 
institutions and suggests that these mechanisms will be 
activated more frequently when the policy divergence 
between coalition partners is large. 

While we believe that cabinet-level institutions may 
play an important role in managing intracoalition con- 
flict, we also contend that there is a second institutional 
arena for resolving conflict within multiparty govern- 
ments that has received surprisingly little attention 
from scholars. In particular, we focus on the role of 
institutions at the legislative level that provide coalition 
partners the opportunity to challenge and amend min- 
isterial policy proposals. Several reasons suggest that a 
stronger focus on legislative institutions is warranted. 
One is connected to the size of the cabinet. The same 
time and resource constraints confronting the cabinet 
that make delegation to individual ministers necessary 
in the first place also hinder the ability of ministers to 
monitor and challenge one another directly. As Thies 
(2001) points out, for example, smaller parties are often 
constrained in their ability to use JMs as an effective 
monitoring tool because they have only a small number 
of JMs to assign (589). As a result, the costs of trying 
to monitor hostile ministers from within the cabinet (c) 
may be so high that coalition partners are effectively 
forced to rely on mechanisms outside the cabinet to 
control ministerial discretion. A second reason derives 
from the formal rules regarding policymaking in par- 
liamentary systems. All major legislation requires the 
explicit assent of parliament. Thus, while cabinet-level 
institutions might be used to amend draft bills, these 
institutions cannot ensure passage of the revised bill 
until the final parliamentary vote has occurred. Where 
coalition parties feel the need to counteract “minis- 
terial drift,” they may therefore choose to do so dur- 
ing the legislative process. Indeed, as the descriptive 
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statistics in the next section will show, government bills 
typically undergo a substantial amount of change in the 
course of legislative review. 

Perhaps the most important reason to focus on the 
use of legislative scrutiny to counteract ministerial dis- 
cretion is that parliaments often provide considerable 
opportunities for this purpose. In particular, many par- 
liamentary systems with proportional representation 
rules and coalition governments feature systems of 
strong standing committees that closely correspond to 
ministerial jurisdictions.’ Such systems provide oppor- 
tunities for committee members to acquire the pol- 
icy expertise necessary to revise complex legislation. 
These committees typically have broad powers, includ- 
ing the right to schedule hearings, call witnesses, sub- 
poena relevant documents, and the ability to propose 
amendments. As a number of scholars have stressed, 
these institutional provisions provide one important 
avenue for opposition parties to influence policymak- 
ing (Powell 2000, 32; Saalfeld 2000, 365; Strøm 1990, 
71). Another—much less emphasized—aspect of these 
institutional structures is that they provide a tool that 
parties in a governing coalition can employ to manage 
the risks posed by ministerial discretion. Parties can 
rely on members of their legislative faction to investi- 
gate the anticipated consequences of a proposed bill, to 
study the justifications offered by the drafting minister, 
to consider available alternative policies, and to draft 
amendments to ministerial drafts. Moreover, the work 
of committees typically proceeds behind closed doors, 
further reducing the cost of challenging and amending 
ministerial drafts from within the coalition. In other 
words, parliamentary scrutiny can play a central role in 
allowing coalition members to counteract the poten- 
tial problem posed by ministerial discretion. Anecdo- 
tal evidence suggests that this is indeed the case. For 
example, Paterson and Southern (1991) have observed 
with respect to Germany that “parliamentary groups 
of the coalition parties do not accept uncritically gov- 
ernment proposals, particularly when such proposals 
derive from ministries held by another party to the 
coalition” (120). Naturally, the institutional structure 
of the legislative process may have significant implica- 
tions for the efficacy of this mechanism in “correcting” 
ministerial drafts (i.e., the parameter c is a function of 
these institutions). 

In summary, our analysis of the central tension in 
coalition government between the agreement to pur- 
sue a compromise policy and the desire (and, if left 
unchecked, the opportunity) of individual ministers 
to move policy in directions favored by their respec- 
tive parties suggests that coalition members will ac- 
tively monitor, and if necessary correct, the policies 
proposed by their partners. Moreover, our model sug- 
gests that the incidence of amendments to ministerial 


10 For example, see Table 2.2 ın Powell 2000, 34. The table strikingly 
reveals the powerful correlation between strong committee systems 
and parliamentary systems that operate under PR and typically pro- 
duce coaliton governments The weakest committee systems over- 
whelmingly occur m majoritarian systems that largely rely on single- 
party government 
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drafts should become more pronounced as the level 
of policy disagreement between parties increases. As 
other scholars have emphasized, such correction may 
be achieved through cabinet-level institutions. Signif- 
icantly, we argue that it may also occur during the 
legislative process. In the remainder of the article, 
we subject this hypothesis to systematic cross-national 
analysis. 


THE DATA 


Testing our argument requires an empirical analysis 
of the changes made to ministerial draft bills in the 
course of parliamentary review. To this end, we have 
constructed an original dataset that details all the mod- 
ifications made to government-sponsored legislation 
between introduction and final resolution in Germany 
(1983-94) and the Netherlands (1982-94).!! The main 
advantage of focusing on these countries is that they 
are similar in terms of a number of coalition-specific 
features and institutional rules that, in addition to the 
policy differences between coalition parties, may be 
relevant in explaining the alterations made to govern- 
ment bills. For example, during our time frame, both 
countries were governed by coalitions that controlled 
a majority of legislative seats and contained no sur- 
plus members.” Both countries are nonmajoritarian 
(or “consensus”) democracies with proportional elec- 
toral systems and multiple parties that share a num- 
ber of legislative institutions that strengthen the role 
of legislators in policymaking. For example, both the 
Bundestag and the Tweede Kamer feature a large num- 
ber of specialized committees (on which seats are 
proportionately distributed among parties) with ju- 
risdictions that “shadow” those of government min- 
istries (Inter-Parliamentary Union 1986). The agenda 
for committee deliberation and plenary debate is de- 
termined by the chamber itself (the Netherlands) or 
by a committee of the chamber in which parties are 
proportionally represented (the “Council of Elders” in 
Germany) (Doring 2001). Further, in neither country 
are committee powers restricted by a prior decision of 
the plenary on the bill’s general principles (Mattson 
and Strøm 1995). Finally, at the final voting stage on 
ordinary legislation, the lower houses in both coun- 
tries employ simple majority decision rules, have the 
same quorum requirements, and use identical voting 
methods (a “successive” procedure) on bills and any 
proposed amendments (Rasch 2000)."° 

It is also worth noting that with respect to each of 
these institutional features, the plurality of parliaments 





11 We restrict our empirical analysis in this study to the changes 
made to bills in the lower chamber of parlament, as this is the arena 
m which most coalition bargaining takes place. 

12 The three governments m the Netherlands were composed of the 
CDA and VVD (1982-86, 1986-89) and the CDA and PvdA (1989- 
94) The three governments in Germany were all coalitions of the 
CDU/CSU and FDP (1983-87, 1987-90, 1991-94). 

13 One potentially relevant procedural difference between the leg- 
islatures in Germany and the Netherlands concerns the i 
authonty of committees We discuss this issue in more detail later m 
this section 
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among European democracies fall into the same gen- 
eral category as Germany and the Netherlands. More- 
over, most European democracies in the postwar pe- 
riod, just as Germany and the Netherlands, have been 
governed by minimal winning coalitions (Woldendorp, 
Keman, and Budge 2000). We therefore believe that 
these two countries are fairly representative of other 
European parliamentary democracies in a number of 
important ways. At the same time, the obvious draw- 
back of our “control by design” strategy is that we 
will not be able at this juncture to assess the impact 
of alternative institutional arrangements and coalition 
attributes on the extent of bill modification. We return 
to this matter in the conclusion. 

Our design also has the advantage of extending over 
a relatively small number of years, making our inves- 
tigation less prone to unmeasured temporal effects. 
Moreover, the choice of time frame allows us to use 
a fairly recent expert survey conducted by Laver and 
Hunt (1992), which provides extensive information on 
both the policy positions and the saliency weights for a 
large number of parties on a variety of issues. More 
specifically, each country expert contributing to the 
study located parties in his or her respective country on 
a 20-point scale in eight distinct policy areas. They also 
assessed the saliency that parties assign to each policy 
area, also on a 20-point scale.'* 

We selected our sample of government bills by start- 
ing with the full set of bills introduced in our time 
frame and then eliminating certain types of bills from 
consideration. Specifically, we excluded budget bills, 
budgetary adjustment bills, and bills proposing amend- 
ments to the constitution because the rules of proce- 
dure associated with them are different from those as- 
sociated with ordinary legislation (Inter-Parliamentary 
Union 1986). Of the remaining bills, we excluded any 
that we could not reasonably classify into the eight 
policy areas from the Laver and Hunt study. The final 
dataset consists of 336 bills, each of which is classified 
into one (and only one) of the eight issue categories. 
(Examples of the types of bills coded into these cate- 
gories are given in the Appendix.) 

The thrust of our argument is that delegation to 
cabinet ministers poses a principal-agent problem for 
coalition governments, that this problem is particularly 


14 Like Muller and Strøm (2000), we consider the Laver and Hunt 
study “the most suitable exemplar of expert judgments” (9). The sur- 
vey 1s comprehensive and methodologically ngorous. Although one 
drawback of expert surveys is that judgments may be “contaminated” 
from observing party behavior, we view this as less problematic in 
our case due to the relative lack of information available on changes 
made to ministerial policy proposals in the legislative process More- 
over, the main alternative to expert surveys, the content analysis of 
party manifestos conducted by the Comparative Manifestos Project 
(CMP), has several problems that scholars have uncreasingly come 
to view as troublesome. These include the lack of consensus over 
how to generate policy positions from data that were originally in- 
tended to measure issue saliency (Janda et al 1995), the “ambiguous 
methodological status of the coding scheme” used to place manifesto 
quasi-sentences into policy categones (Laver and Hunt 1992, 31), and 
the substantial amount of intercoder unreliability (Volkens 2001) 

Also problematic for our purposes ıs that CMP researchers typically 
excluded parties that they viewed as “noncoalitionable,” including 
all far-left parties and far-right religious parties in the Netherlands. 
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TABLE 1. 


Descriptive Statistics 
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(a) Dependent Variable: Number of Article Changes in Government Bills 


Percentile 


10 


70 


100 


Mean 
3.82 


Number of Article Changes 


Vanlance N 
44.60 338 


(b) Independent Varlables* 


Independent Vanable 
Govemment Issue Divisrveness 
Opposition Issue Divisiveness 
Junlor Minister from Partner Party 
Number of Committee Referrals 
Number of Articles In Draft BII! (Logged) 


lration of Bill Before PI Vote 


Mean SD 
1.83 1.69 
6.98 2.84 
0.55 0.50 
2.53 2.26 
1.67 0.85 
0.08 0.27 


“Descriptive statstics for country and issue indicators are grven in the Appendix 


pronounced on issues that divide the coalition inter- 
nally, and that parties will take advantage of opportu- 
nities presented by the legislative process to counteract 
the negative effects of delegation to ministers. More- 
over, Our argument implies that we should see more 
such “corrective” activity on bills that divide the coali- 
tion. As a result, our empirical analysis focuses on the 
degree to which ministerial drafts of government bills 
are altered in the course of legislative review. Develop- 
ing a measure of how bills have changed thus requires 
a comparison of the final version of the bill adopted by 
the legislature to the version of the bill drafted by the 
proposing minister. 

Ideally, a summary measure would allow us to assess 
fully the substantive policy impact of changes made 
to a draft bill. To make such an assessment, however, 
we face a rather thorny issue. Specifically, given the 
technical nature of most modern legislation, grasping 
the policy significance of changes to a draft bill by clas- 
sifying the substantive content and language of such 
changes requires extensive expertise in the policy areas 
dealt with by the bill. Of course, the issue at the heart 
of the delegation problem in modern parliamentary 
systems is that a high level of expertise across multi- 
ple issue areas is extremely hard to come by, even for 
seasoned legislators and cabinet ministers. Naturally, 
the situation is even worse for political scientists, and 
any measure based on our perceptions of substantive 
policy impact is therefore bound to be highly unreli- 
able, especially when applied to a large number of bills 
across a variety of policy areas. 

Given this, we focus instead on developing a more 
objective measure of change, which we define as the 
number of article changes made to the draft version of 
a government bill. Specifically, the measure we will be 
using as our dependent variable is defined as Nurnber 
of articles altered (or deleted) in the draft version of 





bill + Number of new articles added to the draft ver- 
sion of bill.” The dependent variable is thus an integer 
that is bounded from below by zero (a case in which 
a draft bill is not altered at all in the legislative pro- 
cess) and unbounded from above. This measure has 
several virtues. Most importantly, it should tap into the 
substantive impact of the changes made to a bill. Bills 
are normally subdivided into numerous articles, and 
each of these articles addresses a different substantive 
aspect of the bill. As a result, we believe that articles can 
justifiably be treated as the logical “policy subunits” of 
a bill. This belief is reinforced by the fact that final 
voting on the text of bills in parliamentary systems is 
normally conducted on an article-by-article basis (see 
Rasch 2000 for a review of voting procedures in sev- 
eral European legislatures). The second advantage of 
this measure is that it can be reliably coded by other 
researchers. 

In Table 1a, we present descriptive statistics for this 
measure for our sample of government bills. We first 
note that a substantial portion of bills (approximately 
30% of the sample) emerges from the legislative pro- 
cess completely unchanged. The remaining 70% of 
bills, however, are altered to varying degrees before 
they are passed into law. Specifically, half the bills in 
the sample receive more than two article changes in the 
legislature, and on average, there are approximately 
four article changes per bill. We view this as a substan- 
tial degree of change given that the average draft bill 
in the sample is about eight articles long. 


D In constructing this measure, we did not count any minor copy- 
editing changes made to a bill, such as corrections of spelling, capi- 
talryation, and punctuation. 

16 Being unbounded from above, this measure allows us to distmn- 
gush between a draft bill in which every article 1s amended and 
a draft bill in which every article is amended and extra articles are 
attached, which we would view as a more significant degree of change. 
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We now turn to a discussion of how we constructed 
the key explanatory variable, as well as the set of con- 
trol variables, that will be used in our empirical analysis 
of bill change.!’ (The descriptive statistics for these 
variables are given in Table 1b.) Our central theoretical 
claim is that as the coalition becomes more divided on 
policy, coalition partners are more likely to make use 
of the legislative process to “correct” government bills. 
To evaluate this claim empirically, we draw upon the 
Laver-Hunt expert survey to develop a measure of 
the degree of policy divergence within the coalition. In 
constructing this measure, which we term government 
issue divisiveness, we first used the 20-point Laver- 
Hunt position scores to calculate the absolute policy 
distance (in the relevant issue area) between the part- 
ner party in the coalition and the party that controls the 
ministry responsible for drafting the bill. (To account 
for the possibility that larger parties may be in a better 
position to initiate changes because they control more 
legislative resources, e.g., more committee assignments 
and votes in the chamber, this distance is weighted by 
the proportion of legislative seats controlled by the 
partner.) Second, we weighted this distance measure 
by a measure of the relative saliency coalition parties 
assign to the issues being dealt with by the bill.18 The 
purpose of this final weighting is to account for the 
possibility that whether a given bill is sufficiently di- 
visive to prompt parties to engage in position-taking 
and amendment activity may depend on the degree to 
which parties care about the issues under considera- 
tion. 

We also take into account a number of other van- 
ables we believe should be relevant in explaining the 
changes made to government bills. First, it is possible 
that the divisions that exist between the party of the 
proposing minister and its coalition partners also exist 
between the minister and parties in the opposition (i.e., 
the minister’s party could hold an extreme position 
with respect to all parties in the legislature on the is- 
sues at hand, not just the parties in the coalition). To 


17 Tn addition to the variables discussed here, the event count mod- 
eling techniques we use to conduct the analyms, discussed in more 
detail in the next section, provide estimates for two parameters that 
concern the rate at which events (ın our case, article changes) occur 
over the course of an observation period. The first of these rep- 
resents the amount of “exposure” (i.e., the interval of time) over 
which events are accumulated and counted. Including an additional 
parameter in the model for the exposure variable and estimating it 
like any other parameter is seen as the most general means of con- 
trolling for vanability in time intervals (Maddala 1983, 53; see also 
King 1989, 124-26) The second parameter represents the degree of 
overdispersion in the data, which occurs if the article changes made 
to a bill over the course of the legislative process are not independent. 
As these variables are not of primary substantive interest to us, we 
only briefly address their effects in our discussion of the analysis and 
findings 

18 Our measurement of government issue importance (saliency) was 
computed by first scalmg the Laver-Hunt saliency scores so that an 
issue of average saliency for any government party receives a score of 
one. We then calculated a coalition-specific relative saliency score on 
each issue by summung these (weighted) party-specific scores across 
the parties in government. Our government saliency measure has 
the property that a score greater than one indicates a relatively more 
salient issue for the coalition, while a scaled score less than one 
mdicates a relatively less salient issue 
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isolate the effect of intracoalition policy differences on 
changes to government bills, we must therefore also 
consider the policy preferences of the opposition. This 
also allows us to assess the empirical strength of pre- 
vious comparative research that has highlighted the 
opportunities legislative institutions provide to oppo- 
sition parties for exerting influence over policymaking 
(Doring 1995; Müller and Strøm 2000; Powell 2000; 
Strøm 1990). To this end, we have constructed a vari- 
able (developed along the same lines as the govern- 
ment divisiveness variable) that measures the policy 
distance between parties in the opposition and the min- 
ister responsible for initiating the bill (opposition issue 
divisiveness). If the opposition does possess significant 
influence over policymaking, we should expect to see 
more changes being made to those bills dealing with 
issues that greatly divide opposition parties from the 
party of the minister proposing them. 

We also include an indicator for whether the ministry 
from which a draft bill is initiated contains a jumor min- 
ister from the partner party of the proposing minister. 
Recall that, although we expect the legislature to be 
a central arena in which coalition parties amend pro- 
posals originating from ministries controlled by their 
partners, it is likely that changes in proposals are also 
made at an earlier stage, such as in meetings of the 
cabinet or of the relevant cabinet committees, or in 
the originating government department at the earliest 
drafting stages. While we cannot directly observe this 
presubmission process, we would expect a greater de- 
gree of amendment activity to occur at these stages, all 
else equal, the greater the level of information ava- 
able to coalition partners concerning the actions of the 
originating minister and any feasible alternatives to the 
policies being proposed in the draft. As Thies (2001) 
and several contributors to the Müller and Strøm 2000 
volume have pointed out, appointing junior ministers 
toa “hostile” ministry is acommon means employed by 
coalition parties to gather such information and “keep 
tabs” on their partners. To the extent that junior minis- 
ters serve this purpose, we should expect that ministe- 
rial deviations from the coalition compromise are more 
likely to be “caught early,” and that accordingly there 
will be less need for change at the legislative stage. In 
other words, we expect the presence of a partner party 
in the ministry drafting a bill to reduce the number of 
changes made to the bill in the legislature. 

Another control variable we incorporate into the 
model] is the number of committees to which a govern- 
ment bill is referred. It is quite common for bills to 
be referred to more than one legislative committee. In 
our sample, between two and three committees, on av- 
erage, are assigned the task of reviewing a government 
bill. In general, we expect to see more changes made 
to bills, other things equal, the greater the number of 
committees involved in the review process. That is, the 
greater the number of legislators from a party who are 
able to scrutinize a minister’s proposal, the greater the 
likelihood that deviations from the coalition agreement 
can be caught and corrected. 

We include two other variables that take account of 
the amount of material and time legislators have to 
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work with when they are reviewing a government bill. 
The first of these variables, the number of articles in the 
draft version (which we log in the statistical analysis), is 
meant to account for the possibility that draft bills with 
many articles are likely to have more articles changed 
than draft bills with few articles. Even though our mea- 
sure of the dependent variable allows article changes 
to exceed the number of articles in the original bill, we 
believe it is probably easier for legislators to rewrite an 
existing article than to add a new one. Thus, we would 
expect a positive relationship between the number of 
draft articles and the number of articles changed. Sec- 
ond, we include an indicator for the expiration of a 
bill before the plenary vote. For our purposes, a bill is 
defined as expired if it does not reach the final voting 
stage in the lower house by the time the governing 
coalition leaves office. Naturally, we expect to find that 
such bills have not been changed as extensively as bills 
that received a full review in the legislative process.’” 

As discussed earlier, we have focused our attention 
on Germany and the Netherlands because of their simi- 
larities in a number of important respects. That said, we 
recognize that these two countries (or any two coun- 
tries, for that matter) will not be completely alike in 
terms of all the details of the review procedures used 
to scrutinize legislation. For example, one potentially 
important difference in the legislative processes is that 
in Germany, a bill may be altered by a majority of 
committee members before it is sent to the floor, while 
in the Netherlands, committee members can only offer 
amendments to bills in their deliberations, which are 
then subject to a majority vote of all legislators at the 
plenary stage. Because this difference allows correction 
of ministerial drafts “behind closed doors” in Germany 
(which would imply a lower cost of revision), we might 
expect to see more extensive changes being made in 
the German case. Naturally, having just two countries 
in the study means that we can address such differences 
only in a limited, way—namely, by including a country 
indicator variable in the analysis—and that we must 
be cautious about drawing inferences concerning the 
effects of different legislative procedures solely on the 
basis of this indicator.” 

Finally, we include separate indicators for the par- 
ticular issue area addressed by a bill in recognition of 
the possibility that bills dealing with certain types of 
issues are always changed more or less extensively, re- 
gardless of the policy preferences of coalition partners. 


19 There are 26 such bills in our sample. In these cases, since we have 
no final version of the bill to use in our comparison, we count the 
number of article changes in the bill that had been adopted at the 
committee stage. Dropping these bills entirely from the sample does 
not change our findings in any appreciable way 

Importantly, we would not expect the presence or absence of 
committee redrafting authority to alter the nature of the relation- 
ship between bill change and coalition policy divergence. That is, 
regardless of whether the changes made to bills occur prmarily at 
the committee stage or at the floor stage, we would expect to see a 
greater number of them on more drvisive legislation Nonetheless, 
as a robustness check, we also investigate in the empirical analysis 
whether “depooling” bills in the two systems has any mmpact on our 
conclusions. 
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This could occur for a variety of reasons. For exam- 
ple, one possibility is that some types of issues are in- 


-~ herently more complex than others, thereby requiring 


greater informational resources from legislators to ini- 
tiate changes. Another possibility is that outside lobby 
groups or advisory bodies are better organized in some 
policy areas than in others, and consequently, the in- 
formation provided to legislators by these groups will 
make proposing feasible changes to government bills 
more or less difficult to achieve. 


ANALYSIS AND FINDINGS 


We use an event count model (specifically, the negative 
binomial) to assess the empirical validity of our argu- 
ment. Event count models have become the standard 
approach in political science applications where the 
question of interest concerns the number of events that 
occur over a particular period of time, such as in the 
present case, where we focus on the number of article 
changes made to a bill over the course of the legislative 
process. In most event count formulations, the system- 
atic component of the parameter A,, which is defined as 
the rate of event occurrence for an observation period 
i, is expressed as an exponential function of a set of co- 
variates in X. Thus, for a random event count Y,, the rate 
of event occurrence is defined simply as A, = e*'. The 
stochastic component of A, is most often assumed to 
follow the Poisson distribution, which implies that the 
events accumulating during the observation period are 
conditionally independent and that the rate of event oc- 
currence is homogeneous across any given time period. 
These are clearly strong assumptions that are probably 
violated in our case. In particular, we expect that once 
party groups have expended the legislative resources 
to make one substantive change to a government draft 
bul, it is marginally less costly to make several more 
changes. This is known as positive contagion, which re- 
sults in overdispersion in the random event count, Y,.?! 
The general solution for dealing with overdispersion 
involves the use of a distribution known as the negative 
binomial, which allows A, to vary across an observation 
period (see King 1989) and which allows for estimation 
of the degree of overdispersion as a parameter from the 
data. 

In Table 2, we present the results of the negative 
binomial analysis. In column 2, we present the para- 
meter estimates for our independent variables, expres- 
sed in terms of their relationship to the number of 
article changes.” A positive coefficient implies that 


71 That is, the variance of Y, will be greater than its mean. The Poisson 


distnibution, however, restricts the vanance to equal the mean 
(equicispersion) The costs of incorrectly assuming equidispersion 
are standard errors that are generally biased downward (King 1989). 
2 The coefficient estimates for the issue-specific intercepts and the 
exposure and overdispersion parameters are not displayed (but are 
available upon request). Most significantly, these results show that 
the dispersion parameter from the negative binomial model (with a 
coefficient estimate of 0 41 and a standard error of 0.07) 18 statistically 
different from zero (p < .001) This implies that we can safely reject 
the null (Poisson) assumption of independence and homogeneity ın 
the article changes made to a bill over the penod of legislative review. 
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TABLE 2. Negative Binomlal Model of the Number of Article Changes in 


Government Bills 


Coefficlent 


Percentage Change In the 
Expected Number of 


Independent Varlable Estimate* Article Changes? 


Govemment Issue Divistveness 
Opposttion Issue Divistveness 

Junlor Minister from Partner Party 
Number of Committee Referrals 
Number of Articles In Draft BII! (Logged) 
Explration of Bill Before Plenary Vote 


Germany 


29.8" 

—5.2 
—26.5* 

20.7" 
114.2" 
—51.8 * 


58 9™ 


Note’ *p < 05, “p < 01 (one-tailed). Logdikellhood = —660 72. x? (15) = 288 75 (p < 001, two- 


tailed). N = 336 


a Cell ontries in this column are unstandardized maximutmikelinood estimates (with standard errors In 
parentheses). Estimates for the issue-specrfic intercepts and the exposure and dispersion parameters 


are not displayed. 


b Cell entries in this column represent the percentage 


in the number of article changes in a 


change 
bill resulting from an Increase of one standard deviation in the corresponding independent varlable 


(or one unit for dichotomous indicator vartables) 


an increase in the level of the associated independent 
variable will increase the number of articles in a bill 
that are changed in the legislative process, while a neg- 
ative coefficient implies a decrease in the number of 
article changes. As with most other nonlinear models, 
the coefficient estimates from an event count model 
do not lend themselves to a particularly intuitive in- 
terpretation. To assess the substantive significance of 
our findings, we therefore focus on two quantities that 
are readily interpretable. First, since the negative bi- 
nomial is a log-linear model, it permits an odds ratio 
(or relative risk) interpretation of the estimated coeffi- 
cients. Accordingly, in the third column in the table, we 
re-express the parameter estimates for all variables in 
terms of the percentage change in the expected num- 
ber of article changes in a bill, which is calculated as 
[100(e%C+4) + ef)]/e for a change of ô units in the 
associated independent variable, x. (We set 8 equal to 
one standard deviation for continuous variables and 
one unit for dichotomous indicator variables.) Follow- 
ing our discussion of these effects, we then show how 
a change in our main variable of interest, government 
issue divisiveness, affects the predicted number of arti- 
cle changes made to the typical government bill in our 
sample. 

One of the first messages we draw from the results 
in Table 2 is that policy divisions between coalition 
partners have the expected impact on the number of 
article changes made to government bills. The findings 
are consistent with our argument that bills dealing with 
issues that are more divisive for the government are 
changed more extensively in the legislative process. 
Specifically, from the third column in the table, we see 
that over any given period of legislative review, a one— 
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standard deviation increase in issue divisiveness for 
the coalition increases the expected number of article 
changes in a bill by approximately 30%.” 

In contrast, we find no evidence that opposition is- 
sue divisiveness has an impact on changes in govern- 
ment bills. This null finding is at odds with much of 
the conventional wisdom in comparative research that 
portrays legislatures in these “consensus” systems as 
arenas in which opposition parties can exert real influ- 
ence in policymaking. If opposition parties in fact have 
the power to influence policy in meaningful ways, then 
we should expect to see them use it to a greater degree 
on those policy proposals that are presumed to deviate 
more widely from their preferred policy outcomes. Our 
finding that they are not, in fact, able to effectuate 
preater change in conflictual policies is all the more 
striking when placed in the context of the descriptive 
statistics presented in Table 1, which show that, on av- 
erage, the issue divisions between opposition parties 
and the party of the proposing minister are almost four 
times the size of issue divisions within the cabinet. 

Significantly, we also find that bills drafted in govern- 
ment departments where the proposing minister and 
junior minister come from different parties are changed 
less extensively in the legislative process than bills 
coming from departments having no coalition partner 
presence. Specifically, we see about a 27% reduction 
in the number of article changes in bills that come 
from ministries with this form of “power-sharing” at 
the top. This result fits nicely with the logic laid out by 


B Tn ancillary analysis, we strip the coalition divergence variable of 
its saliency and seat weights, but we find that this makes virtually no 
difference in the results reported in Table 2 
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Thies (2001), who suggests that when coalition parties 
succeed in placing their own members in a position to 
serve as junior ministers in “hostile” ministries they are 
able to keep tabs on one another more effectively. 

Another notable finding involves the number of 
committee referrals. We find that bills are changed 
more extensively in cases where several committees 
are reviewing a bill simultaneously. Specifically, for ev- 
ery two additional committees assigned to review a 
bill, the number of articles expected to be changed in 
the legislative process goes up by approximately 20%. 
The results also show that bills with more articles in the 
original draft ultimately experience a greater number 
of article changes than bills with few articles. More- 
over, we see that bills that do not go through a full 
review in the legislative process have about half the 
number of article changes as bills that make it to the 
final voting stage. Finally, Table 2 shows that bills in 
Germany tend to experience more article changes than 
bills in the Netherlands. Earlier, we highlighted the 
redrafting authority of legislative committees in these 
countries as probably the most significant difference 
between their review procedures in the lower cham- 
ber. We stress again, however, that we must be care- 
ful in drawing inferences about the effects of specific 
procedural differences based on the results of a lone 
country indicator. For now, we simply note that our 
findings regarding the effects of coalition policy diver- 
gence hold even while taking country differences into 
account, and that further cross-national work needs to 
be done before we can draw valid conclusions about the 
role of legislative procedures in facilitating changes to 
government bills. 

We now return to considering the effect of our 
main independent variable of interest by investigating 
how different levels of coalition policy divergence af- 
fect the predicted number of article changes in gov- 
ernment bills. We do this through the use of statisti- 
cal simulations using the CLARIFY software (Tomz, 
Wittenberg, and King 2003). First, we simulate the 
model parameters based on the point estimates and 
variance—covariance matrix from the model in Table 2. 
We repeat this procedure 1,000 times. Then we use 


M Using the fractional pooling techniques suggested by Bartels 
(1996), we also check the robustness of the findings to our choice 
to pool bills across these two systems. In fractional pooling, each ob- 
servation 1n a (country) subsample is weighted by a pooltng fraction, 
y, which reflects the degree to which one believes the observation 
should be discounted Using multiple values of y, we perform tests 
of parameter equality where the null hypothesis is that the frac- 
tionally pooled coefficient estimates are equal to the fully pooled 
estimates (y = 1) shown in Table 2. Fortunately, the results from 
this exercise show that even when y = 0 (i.e, when we run separate 
country-by-country regressions), the coefficient on government issue 
divisiveness in each country rs positrve (0.23 ın the Netherlands and 
0.44 in Germany) and statistically significant, and in no case are we 
able to reject the null hypothesis of parameter equality, with one 
interesting exception. Specifically, when we set y = 0, the coefficient 
on opposition issue divisiveness for German bills, which is negative 
and statistically different from zero, becomes statistically different 
from the slightly negative (but statistically insignificant) coeffiaent 
from the fully pooled model. Thus, m Germany, it appears that bills 
that separate opposition parties from the minister are ciscernibly 
less likely to be changed extensively in the legislative process 
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these sets of simulated parameters to generate a set of 
predicted values of the number of article changes for 
a low level of government issue divisiveness (one stan- 
dard deviation below the mean level of divisiveness), 
holding all other variables at their means. This allows 
us to approximate the entire distribution of predicted 
values for this level of divisiveness. Then, using the 
same set of simulated parameters, we generate a set of 
predicted values of the number of article changes for a 
high level of government issue divisiveness (one stan- 
dard deviation above the mean level of divisiveness), 
again holding all other variables at their means. This 
gives us a second distribution of predicted values. 

In Figure 1, we present the results of these simu- 
lations for a bill of average length (about eight arti- 
cles). The arrows indicate the median outcome in the 
predicted number of article changes for each level of 
divisiveness. As a comparison of the two curves shows, 
bills that divide coalition partners are predicted to be 
changed substantially more in the legislative process 
than bills that do not divide them. The median number 
of article changes made to a relatively divisive bill is 
expected to be twice as large as the number of changes 
made to a relatively nondivisive bill. Moreover, a con- 
siderable number of nondivisive government bills, over 
20%, are predicted to experience no change at all, com- 
pared with only about 8% of divisive bills. Finally, we 
see that a substantial portion of divisive bills is expected 
to experience a large number of article changes. Com- 
pared to only 4% of nondivisive bills, approximately 
15% of divisive bills are predicted to have at least 
as many article changes as there are actual articles in 
the draft bill. These predictive results obviously lend 
strong support to our central claim that policy divi- 
sions between coalition partners have a considerable 
substantive impact on the extent of changes made to 
bills in the legislative process. 


CONCLUSION 


The study of coalition government in parliamentary 
democracies has been divorced from the study of leg- 
islatures for far too long. Beyond taking account of the 
constitutional requirements that connect legislatures to 
the formation and termination of coalitions, coalition 
scholars have traditionally paid little attention to the 
role of parliaments in policymaking and governance. 
Similarly, comparative legislative scholars, impressed 
by the dominance of cabinet-sponsored legislation in 
parliamentary systems and the cohesive voting be- 
havior of highly disciplined parties, have diagnosed 
a “marginalization of parliament” or “parliamentary 
decline” (see Gamm and Huber 2002). Our argument 
challenges this assessment. We move legislative pol- 
itics to a More prominent position in understanding 
multiparty government by considering how coalition 
partners might use legislative institutions to achieve 
their separate, and often competing, electoral and pol- 
Icy goals. 

In placing greater emphasis on the role of legislatures 
in parliamentary systems, our argument complements 
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Government Issue Divisiveness and Predicted Change In Leg!slation 


* For a bill of average length (approamatety eight artictes), with all other covanates set at thelr mean values. 


other recent approaches that have paid more attention 
to the interplay between parliamentary institutions and 
multiparty governance. The emphasis of these studies 
has primarily been to show that the legislative pro- 
cess bolsters the opportunities of opposition parties to 
wield influence over the direction of public policy. Thus, 
Strøm (1990) and Powell (2000) have highlighted the 
central role of strong committee systems fer strength- 
ening the opposition during the legislative process. In 
contrast, we have focused on the central role of the 
legislative process in strengthening government par- 
ties that have been forced by the requisites of mod- 
ern parliamentary government to delegate important 
agenda-setting powers to their coalition partners. Such 
delegation creates inherent tensions between coalition 
partners that grow particularly strong on issues that 
sharply divide them. In part, coalition governments at- 
tempt to manage these tensions through cabinet-level 
institutions (Müller and Strøm 2000; Thies 2001). A key 
argument of our study is that the legislative process 
provides another important institutional device that 
coalition partners use to counteract the influence of 
ministers who may have incentives to draft bills that 
more closely reflect the preferences of their own con- 
stituents than the agreed-on coalition compromise. 

In testing this argument, we drew on a unique dataset 
that tracks the legislative history of more than 300 gov- 
ernment bills in two parliamentary democracies. These 
data allow, for the first time, a systematic analysis of 
the changes made to a large number of government- 
sponsored bills across a number of different issue areas. 
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Consistent with our argument, this analysis demon- 
strates that ministerial drafts are likely to be changed 
more extensively in the legislative process the greater 
the level of policy divisiveness among parties that par- 
ticipate in the coalition. Perhaps surprisingly, we find 
no evidence that greater policy divergence between 
ministers and the opposition leads to more extensive 
change in government bills, despite the fact that there 
seems to be no shortage of bills that divide them.” 
Taken together, these findings suggest that the process 
of legislative review provides an advantage to parties ` 
that participate in government by allowing them to 
manage the tensions created by the need to govern 
jointly, but it may not provide as many advantages to 
parties relegated to the opposition benches as previ- 
ously believed. 

We make these claims bearing in mind two qual- 
ifications that warrant further investigation in future 
studies. The first is that our analysis has focused on 
situations in which coalition parties collectively control 
a legislative majority. A substantial number of govern- 
ments in parliamentary democracies, however, control 
only a minority of legislative seats. By definition, such 
minority governments must depend on at least one op- 
position party to stay in office. A valuable extension 
of our argument would be to examine whether, and 


25 This fits nicely with our previous finding that greater drvision 
between the opposition and the proposing munister on a bill only 
slightly increases the odds that the bill will expenence delay in the 
legislative process (Martin and Vanberg 2004). 
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to what degree, the preferences of opposition parties 
matter in these situations. The second qualification is 
that we have confined our empirical analysis to “con- 
sensus” or “proportional” systems, which provide sub- 
stantial opportunities for legislative review of govern- 
ment bills. Although these types of systems are pre- 
dominant among parliamentary democracies in which 
coalition government is common (see Powell 2000, 34), 
we recognize that there is variation in the strength 
of legislative institutions, even among consensus sys- 
tems. More significant from our perspective are the 
differences between consensus systems and their “ma- 
joritarian” counterparts. In majoritarian systems, the 
ability of parties to use the legislative process to review 
government bills is greatly diminished. Committees in 
these systems are relatively few in number, they are 
generally nonspecialized (with policy jurisdictions that 
only weakly correspond to those of government de- 
partments), and they are typically restricted in their 
ability to make major changes to government bills. 
As a result, coalition parties will have a more difficult 
time making use of the legislative arena to police their 
partners. We would not be surprised to find that multi- 
party governments in these systems rely more heavily 
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on cabinet-level institutions to resolve intracoalition 
conflicts and/or that these governments more closely 
resemble a pattern of “ministerial autonomy” than the 
pattern of “ministerial correction” uncovered in this 
study. 

In conclusion, our findings are significant because 
they suggest a perspective on legislative institutions 
in parliamentary systems that has received little at- 
tention to date. The same legislative institutions that 
have traditionally been associated primarily with op- 
position influence also play a central role within coali- 
tion governments by enabling parties with divergent 
preferences to pursue compromise policies success- 
fully. Parliaments are important not only as fora in 
which the government and opposition interact. The 
legislative process also provides an institutional mech- 
anism that may be central to the ability of coalition 
governments to solve intracoalition conflicts and to 
contain the potentially detrimental effects of delega- 
tion to ministers. Naturally, our results represent only a 
beginning, and in future work we hope to explore more 
fully the implications of the qualifications noted above 
for the ability of coalition governments to “govern 
jointly.” 


TABLE A1. Coding of Government Bills and Classification by Pollcy Issue and Country 


Percentage of Bills by Issue” 


Pollcy Issue Types of Bills Classified Germany Netherlands Total 


Tax policy 


Income taxes, value-added tax, tax 53.1 50.3 51.5 


allowances, welfare or health services 
benefits, disabled workers’ benefits, famlly 


allowances 


Foreign policy 


Industrial policy 


Relations with the Soviet Union, NATO, or 
Warsaw Pact (Note: No bills after 1989 
were collected on this Issue) 

Industrial production levels, Industrial 


relations, state-owned corporations, 
market (de-)regulation, unlons and 
employer associations, wage pollcy, Job 
training, economic competitiveness 


Social pollcy 


Abortion, homosexuality, alternative 


Ifestyles, domestic cohabitation, 


pomography, “morality” Issues 
State Intervention into religious affalrs 


Quota levels and pnce regulation of 


Clencal policy 
Agricultural policy 


agncultural goods, farm subsidies 


Regional policy 


Centralization or decentrallzation, alterations 


to municipal or reglonal laws, redistricting 
of communal boundaries, regional reforms 


Environmental pollcy 


Alr, soll, or water pollution, regulation of 


emlssions standards, CFCs, ecological 


reservation 
* Number of bills (336): Germany (147), Netherlands (189) 
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Who Influences U.S. Foreign Policy? 


LAWRENCE R. JACOBS University of Minnesota 
BENJAMIN I. PAGE = Northwestern University 


foreign policy, including experts and “epistemic communities,” organized interests (especially 

business and labor), and ordinary citizens or “public opinion.” This research, however, has often 
focused on a single factor at a time, rather than systematically testing the relative importance of alternative 
possible influences. Using extensive survey data gathered over three decades we conduct a comparative 
test, attempting to account for the expressed foreign policy preferences of policy makers by means of the 
preferences of the general public and those of several distinct sets of elites. The results of cross-sectional 
and time-lagged analyses suggest that U.S. foreign policy is most heavily and consistently influenced 
by internationally oriented business leaders, followed by experts (who, however, may themselves be 
influenced by business). Labor appears to have significant but smaller impacts. The general public seems 
to have considerably less effect, except under particular conditions. These results generally hold over 
several different analytical models (including two-observation time series) and different clusters of issues 
(economic, military, and diplomatic), with some variations across different institutional settings (the U.S. 


R esearch in international relations has identified a variety of actors who appear to influence U.S. 


House, Senate, and executive branch). 


benefits for their members drive American for- 

eign policy? Or is policy more influenced by the 
views of epistemic communities that use their expertise 
to identify the national interest? Alternatively, do the 
collective preferences of ordinary citizens shape U.S. 
foreign policy? Do the wielders of influence vary by is- 
sue domain, the level of issue salience, or the particular 
government institutions involved in policy making? 

Such questions touch on the classical “Who gov- 
erns?” query that has animated much research in 
American and comparative politics (Dahl 1961). But 
they are even more central to debates in international 
relations. They relate to the democratic character of 
American conduct internationally, the extent to which 
calculations of national interest do or do not drive 
U.S. foreign policy, and the general nature of domes- 
tic sources of foreign policy. To answer fundamental 
empirical questions about who influences government 
officials may help pave the way for developing broader 
theory regarding the determinants of international be- 
havior. 

These questions also have bearing on important nor- 
mative issues. Democratic theories emphasizing pop- 
ular sovereignty and electoral accountability argue 
that government policy ought to reflect the views of 


D O organized interest groups seeking narrow 
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ordinary citizens (e.g., Dahl 1989). Adherents of these 
relatively “populistic” democratic theories would pre- 
sumably applaud findings of strong public influence 
on U.S. foreign policy. On the other hand, more elite- 
oriented theories of democracy (Burke 1949; Sartori 
1987; Schumpeter 1976), as well as realist theories of 
international relations (Kennan 1951; Lippmann 1955; 
Morgenthau 1973) would sound an alarm. They ques- 
tion the competence of citizens and assert that the qual- 
ity of foreign policy is likely to suffer if the mass public 
is allowed to have much direct impact. Evidence that 
experts and members of “epistemic communities” sig- 
nificantly affect U.S. foreign policy would be applauded 
by some as holding out the best hope for objectively 
identifying and advancing the national interest in an 
increasingly complex global environment (Adler and 
Haas 1992; Haas 1992; Hall 1989). Alternatively, find- 
ings that organized interest groups drive U.S. foreign 
policy would raise troubling questions about a possible 
“bias in representation” toward particularistic groups 
(Moravesik 1997, 530). 


COMPETING VIEWS OF WHO INFLUENCES 
U.S. FOREIGN POLICY 


Three prominent, empirically based views of who in- 
fluences U.S. foreign policy suggest sharply different 
predictions. 


Neoliberalism and Organized Groups 


Many scholars who take a neoliberal approach to in- 
ternational politics emphasize the decisive influence 
of organized interest groups on foreign policy (e.g., 
Keohane 1984). In this view, executive and legislative 
officials with foreign policy authority bargain with do- 
mestic groups that use their members’ votes, campaign 
contributions, threatened or actual capital flight, labor 
strikes, and other tools to affect the electoral benefits 
and costs to elected officials of choosing alternative 
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policies (e.g., Frieden 1991; Gourevitch 1986; Rogowski 
1989). For instance, Keohane and Milner (1996) trace 
targeted government subsidies and trade protections 
to the influence of well-organized and financed groups; 
Snyder (1991) attributes defense policy to logrolling 
coalitions. 

Organized labor and, perhaps even more so, business 
corporations possess critical resources for pressuring 
policy makers. Given their mission to protect the jobs 
and benefits of its members, “[labor] leaders have spo- 
ken out often on foreign affairs” (Galenson 1986, 62). 
In addition to addressing foreign policies that affect 
its bread and butter interests at home, organized labor 
in the United States has historically also spoken out 
on such matters as anticommunism, the Vietnam war, 
defense spending, and U.S.—Soviet relations, perhaps 
changing stands as the Cold War ended and the AFL- 
CIO’s blue-collar base was challenged by an influx of 
civil service and other unions representing white-collar, 
professional, and service occupations (Lipset 1986). 
Despite the substantial political resources of U.S. or- 
ganized labor, however, including its financial contri- 
butions, volunteers, and voters in many congressional 
districts, there is some disagreement about whether it 
exerts significant policy influence (Gottschalk 2001) or 
not (Esping-Anderson 1990). 

Some neoliberal analysts of international politics 
have singled out business corporations and business 
associations as exerting particular influence on Ameri- 
can foreign policy because of their effects on the econ- 
omy and their capacity to prompt voters to punish the 
incumbent political party (Milner 1997). A number of 
studies have reported influence by business on spe- 
cific types of foreign policy: Rogowski (1989) traces 
economic policies to powerful domestic economic in- 
terests; Trubowitz (1998) points to uneven economic 
growth and struggles for regional economic advantage 
to explain a variety of foreign policies; and Grossman 
and Helpman (1994, 1995) link industry lobbying and 
campaign contributions to international trade relations 
and, specifically, increased tariffs for politically orga- 
nized industries. Some argue that pressures on gov- 
ernments to tailor foreign policy to please business 
have increased over the past three decades with the 
emergence of an open world economy characterized 
by rapid international movement of capital and greater 
exposure to global economic competition (Bates and 
Lien 1985; Winters 1996). 

This line of research suggests that different policy- 
making institutions may vary in their susceptibility to 
organized pressures. Executive branch officials, who 
play a central part in national security and foreign pol- 
icy, have been said to focus on identifying collective 
gains in pursuing the “national interest” and there- 
fore to be somewhat resistant to organized pressure 
(Art 1973; Krasner 1972, 1978; Wildavsky 1991). Or- 
ganized groups may be particularly influential with 
Congress, where senators and, especially, Represen- 
tatives (elected in relatively small districts) may be 
acutely responsive to demands for concentrated ben- 
efits from narrowly based groups of constituents and 
campaign donors (Milner 1997). 
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In short, interest group—oriented scholars suggest 
that labor and, especially, business should exert a strong 
influence on U.S. foreign policy. 


Eplstemic Communkties and 
Knowledge-Based Experts 


Research on “epistemic communities” indicates that 
the growing complexity and uncertainty of global prob- 
lems has “led policy makers to turn to new and differ- 
ent channels of advice” and specifically to new “net- 
works of knowledge-based experts” in the academy, 
think tanks, and other repositories of technocratic ex- 
pertise in order to articulate the objective causes of 
international problems, the “real” stakes or interests 
of states affected by those problems, and appropri- 
ate policy remedies (Haas, 1992, 12; cf. Hall 1989 and 
Nelkin 1979). In the introduction to an influential spe- 
cial volume of International Organization, Peter Haas 
(1992) argued that the “epistemic community mem- 
bers’ professional training, prestige, and reputation for 
expertise... accord them access to the political system 
and...influence over policy debates” through agenda 
setting and the formulation of policy alternatives in the 
executive and legislative branches (2-3, 17; Adler and 
Haas 1992). 

Research on epistemic communities has two impli- 
cations that are important for us. First, it suggests that 
experts equip government officials to conduct analyses 
and reach decisions that can be independent of direct 
pressures from organized groups or citizens. The im- 
plication, then, is that business and labor may exert 
relatively little direct influence on the foreign policy 
decisions of government officials. 

Second, epistemic communities may serve as con- 
crete mechanisms for identifying and addressing the 
objective interests of states that may ultimately be de- 
fined, as classical and structural realists emphasize, by 
interstate competition and the structure of the inter- 
national system (Walt 1987; Waltz 1959, 1979; but cf 
Rose 1998). Students of epistemic communities argue 
that realists incorrectly “assume that a state’s interests 
are clear and that the ways in which its interests may 
be most efficaciously pursued are equally clear” (Adler 
and Haas 1992, 367-69; Haas 1992, 13-14). Instead, 
they maintain, technical experts are the vehicle for the 
interpretation of international structures, the identi- 
fication of the “imperatives” facing the state, and the 
articulation of state interests in international politics. In 
short, research on epistemic communities suggests that 
conditions of uncertainty produce strong incentives for 
government officials charged with making foreign pol- 
icy to respond to experts from think tanks, the academy, 
and other reservoirs of highly trained specialists and 
professionals. 


Medlan Voter Theory and the Influence 
of Public Opinion 


Median voter theory predicts that actual or anticipated 
electoral competition motivates holders of government 
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office to minimize the distance between their policy 
stands and the policy preferences of voters, thus re- 
sponding strongly to the public’s preferences. (In the 
case of unidimensional, two-party competition, parties 
are predicted to converge exactly at the position of 
the median voter [Downs 1957].) Empirical evidence 
of public influence on foreign policy has been reported 
in a large and growing body of research by students of 
international relations and foreign policy (e.g., Hartley 
and Russett 1992; Holsti 1996; Nincic 1990; Russett 
1990; Sobel 1993, 2001; Wittkopf 1990). Effects of the 
public have also been found in quantitative analyses 
by students of public opinion, who have reported, for 
example, that 62% of U.S. foreign policies changed in 
the same direction in which public opinion had previ- 
ously changed (Page and Shapiro 1983, 182) and that 
congressional district-level public support for military 
spending was related to Congress members’ votes on 
military spending bills during the presidency of Ronald 
Reagan (Bartels 1991; see also Monroe 1979, 1998). 
This and other evidence of influence by the public is 
now recognized in broader debates about international 
relations (e.g., Putnam 1988, 432, 436) and national 
security policy, which Miroslav Nincic (1990) found to 
be “tethered to domestic electoral calculations,” espe- 
cially when elections approach (395). 

The idea of government responsiveness to public 
opinion has also informed research on the “demo- 
cratic peace,” which has found a tendency for individual 
democratic states and, especially, pairs of democratic 
states to be more pacific on average than nondemo- 
cratic states (Russett and Oneal 2001; cf. Elman, 1997, 
10-20). One line of thinking in democratic peace re- 
search argues that competitive elections “makes demo- 
cratic leaders...sensitive to public opinion” because 
politicians either anticipate electoral punishment or 
are thrown out of office for being unresponsive: “Citi- 
zens in a democratic state can influence governmental 
policy directly, through public opinion, or indirectly, 
though their representatives.” 

Salience may be an important condition affecting 
the extent of public influence. E. E. Schattschneider’s 
(1960) analysis of the “scope of conflict” indicates that 
the general public should have its greatest impact on 
highly salient issues that draw intense attention from 
the media and voters and thereby pose the most di- 
rect threat of electoral punishment for government 
officials who are unresponsive. In contrast, narrow, 
well-organized interests may dominate on less visible 
issues. Some empirical evidence seems to support the 
prediction of greater public influence when salience is 
high (e.g., Page and Shapiro 1983, 181). 

There are also indications that the characteristics of 
different government institutions may produce differ- 
ent degrees of influence by the public. For example, 


1 See Peterson 1995, 10-11, and Russett and Oneal 2001, 274, also cf. 
Russett 1996, 100. Alternatively, the democratic peace has been at- 
tributed to liberal norms that promote nonviolent resolution of con- 
flict, international law and shared membership in intergovernmental 
Organizations, and economic interdependence that puts a premium 
on stable, ongoing commercial relations (Doyle 1983, Elman 1997; 
Owen 1994; Russett and Oneal 2001). 
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members of the House of Representatives, which the 
Federalist Papers labeled the “people’s House” due 
to their frequent election in small and decentralized 
districts, were expected to be especially sensitive to 
public opinion, while officials in the executive branch 
and Senate were expected to be less responsive due 
to their insulation from the public by longer terms in 
office and originally indirect elections. 

An ample body of qualitative and quantitative re- 
search, then, indicates that U.S. foreign policy and the 
policy preferences of government officials are substan- 
tially influenced by public opinion. Apparently accept- 
ing such influence as an empirical fact, a long line of 
observers including classical realists has urged policy 
makers not to respond to the preferences of citizens, 
because of concerns that the general public engages in 
“simple moralistic and legalistic” thinking, is detached 
from the reality of international politics, exhibits unsta- 
ble shifting “moods,” and hungers for “quick results” 
(Morgenthau 1973, pp. 135, 146-48; cf. Almond 1950 
and Kennan 1951). Walter Lippmann (1955) warned 
that following public opinion would create a “morbid 
derangement of the true functions of power” and pro- 
duce policies “deadly to the very survival of the state 
as a free society” (15, 20, 26-27). 


Problems with Past Research 


Previous research concerning the impact of organized 
groups, epistemic communities, and public opinion on 
U.S. foreign policy has produced an impressive body of 
results that point in diverse directions regarding who 
influences government officials. It has not, however, 
definitely sorted out the relative impact of different 
factors on U.S. foreign policy. Even excellent case stud- 
ies that disentangle causal mechanisms and trace pro- 
cesses of policy making usually leave open the issue of 
how well they generalize beyond those particular cases. 
Perhaps even more importantly, lack of definitiveness 
has also resulted from two problems related to the 
scholarly division of labor: omitted variables and lack 
of comparative testing. Understandably, each of the 
three main approaches we have reviewed has tended 
to focus on a set of variables of particular interest to it, 
rarely investigating and testing competing explanations 
at the same time. Most studies of public opinion and 
government policy, including foreign affairs (some of 
which were conducted by the present authors [Jacobs 
and Shapiro 1994; Page and Shapiro 1983; but cf. 
Jacobs 1992, 1993, 231-33, and Page 2002]), have failed 
to include any independent variables other than public 
opinion. However understandable this research strat- 
egy may be, it runs the risk that other important influ- 
ences may be neglected. It may lead each approach to 
overestimate the importance of its own favorite factors 
and to offer little or no estimate of the relative impact 
of different possible influences. 

What is now needed, we believe, is comparative ana- 
lysis, based on a large number of diverse cases, of the 
relative influence on U.S. foreign policy of several key 
actors, including organized interest proups (especially 
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business and labor), epistemic communities from think 
tanks and the academy, and mass public opinion. The 
present paper attempts to take a first step in that direc- 
tion. 


DATA AND METHODS 


We have analyzed a set of data that are uniquely well 
suited to this purpose, drawn from eight quadrennial 
pairs of surveys that that were sponsored from 1974 
through 2002 by the Chicago Council on Foreign Re- 
lations (CCFR) and implemented by the Gallup and 
Harris organizations.” These surveys, covering a large 
and diverse set of foreign policy issues, elicited the 
policy preferences of the general public and also the 
preferences of several distinct sets of “foreign policy 
leaders,” including policy makers (government officials 
in the executive branch, the House of Representatives, 
and the Senate), members of critical interest groups 
(especially business and labor), and members of epis- 
temic communities (educators and leaders of private 
foreign policy organizations or think tanks).° 

These parallel surveys of the mass public and for- 
eign policy leaders have both strengths and limitations. 
The government officials and other elites were not ran- 
domly selected for interviews; they were chosen from 
institutional positions involving foreign policy respon- 
sibilities or expertise.* Although a total of 2916 elites 
were interviewed in the eight leadership surveys, the 
number of each specific type of elite interviewed in 
any single year was not very large.’ (The surveys of the 
general public were based on random, relatively large 
samples of about 1550 respondents each.)° 


2 The 2002 study was cosponsored by the German Marshall Fund 
of the United States. Gallup conducted the surveys in 1978, 1982, 
1986, 1990, 1994, and 1998, the Harris organization conducted them 
in 1974, and Harris Interactive did so in 2002. See Rielly (1975 
et seq ) 

3 Of less interest to the present analysis, the leadership surveys also 
included respondents from the media, religious leaders, special in- 
terest groups relevant to foreign policy, and (in 1974) leaders of 
minority groups. As discussed below, we created a single category of 
“experts” by combining “educators” (1 e, faculty who teach in the 
area of foreign affairs and presidents and chancellors of major uni- 
versities), “special foreign policy organizations” (Le., think tanks), 
and “private foreign policy groups” (1.¢., leaders of major foreign 
policy organrzations), after analyzing each group separately 

4 Senators and Representatives, for example, were chosen (at least 
through 1990) from the membership of committees and subcommit- 
tees related to foreign policy. Administration officials came from 
the Department of State and from internationally onented units of 
the commerce, treasury, agriculture, and other departments, though 
rarely from the Department of Defense or the National Security 
Council. Business respondents were sampled mainly from corporate 
vice presidents for international affarrs, and labor respondents from 
high-level union officials oriented toward foreign affairs 

5 The average number interviewed each year was about 76 for gov- 
ernment officials, 58 for business, 28 for labor, and 79 for experts 
Although the categories of respondents were generally quite stable 
across surveys, the 1974 survey combined officials from the House, 
Senate, and administration, added mmority groups, and did not sur- 
vey think tank members For 1978 through 2002, the average numbers 
of policy makers interviewed were 19 from the Senate, 36 from the 
House, and 23 from the administration. 

6 In 2002, 2,862 respondents were interviewed by telephone and 400 
were interviewed in person This made it possible to assess compara- 
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Despite their limitations, the CCFR surveys provide 
what are, so far as we know, the best available data 
on comparably measured foreign policy preferences of 
ordinary citizens, interest groups, epistemic communi- 
ties, and U.S. government officials. They cover a wide 
range of foreign policy issues over a lengthy period 
of time, both during and after the Cold War. Data on 
large numbers of key policy makers are very difficult 
to obtain (but cf. Holsti and Rosenau 1984), especially 
from samples that are comparable over multiple years. 
The CCFR elite samples have the advantage of being 
drawn in a consistent manner across years, because of 
continuity in survey organizations and research teams 
as well as conscious efforts to produce comparable 
data. In addition, the private and confidential nature 
of the interviews with respected survey organizations 
probably helped to discourage public posturing and 
encourage relatively candid expression of views. 

A crucial advantage of these survey data is that they 
permit us to analyze relationships using precise, di- 
rectly comparable measures of the policy preferences 
of policy makers and those of the public, members of 
interest groups, and experts. These measures are based 
on responses to identical questions asked of the various 
groups at the same time. Previous researchers have 
generally lacked such comparable measures and have 
had to struggle with the question of exactly how close 
a given policy did or did not come to the wishes of 
particular actors. 

Some of the interpretations we offer for our analyses 
rest on a key assumption: that the survey-expressed 
policy preferences of government officials can be used 
as reasonable indicators of the policies that they enact 
or pursue. We recognize, of course, that there may well 
be some slippage between the “private” preferences 
of individuals and actual foreign policy. We certainly 
do not suppose that the CCFR data on policy makers’ 
expressed preferences invariably and without excep- 
tion have corresponded with what the government has 
actually done. There are reasons to doubt, however, 
that the dissonance between what officials do and what 
they say they favor doing would often be allowed to 
become extreme. We have carefully scrutinized these 
data and have found that policy makers’ aggregate sur- 
vey responses have usually reflected the positions and 
actions of the institutions in which they held office. 
Given the difficulty of obtaining comparable measures 
for analyzing influences on foreign policy, we conclude 
that, on balance, the limitations of these data are out- 
weighed by the enormous advantage of being able to 
obtain precise, comparable, quantitative measures of 
the positions of government officials, organized groups, 
experts, and the public.’ 


bility with the previous m-person surveys. We use the combined tele- 
phone and in-person data set. Interviews were typically conducted 
in the fall but in 2002 were carried out ın June. 

7 Holsti and Rosenau (1984) report an outstanding study of a wide 
range of U.S decision makers and foreign policy leaders, includ- 
ing high-ranking military officers (unfortunately excluded from the 
Chicago Council surveys) 

8 Even if the correspondence between actual foreign policy and de- 
cision makers’ expressed preferences is imperfect, ascertaining the 


American Political Science Review 


Our dependent variables are measured in a simple 
fashion: the percentage of policy makers (that is, the 
percentage of all policy makers or of a subset of pol- 
icy makers from the administration, the House, or the 
Senate) who favored or opposed a particular policy 
alternative in a given survey.’ We believe that these 
percentage measures generally reflect the position of 
the average policy maker on an underlying policy con- 
tinuum. The percentage favoring a particular type of 
foreign aid, for example, may reflect the amount of aid 
that the average respondent favors.!° 

Our independent variables are measured in the same 
way: percentages of the general public, or of busi- 
ness people, labor leaders, or experts, who favored 
or opposed the same policy alternative that the pol- 
icy makers were asked about. Because “Don’t know” 
and “Refuse to answer” responses are generally more 
common among the general public than among pol- 
icy makers, we recomputed percentages without them. 
This gave us comparable measures of the views of those 
with opinions among elites and the mass public. 

The scope and duration of the Chicago Council’s 
parallel studies of elites and the general public enabled 
us to analyze variations across different institutions 
(comparing subsets of policy makers from the admin- 
istration, the House, and the Senate) and variations 
in levels of issue salience to the public. In addition, 
we separately examined three broad, exhaustive, and 
mutually exclusive policy domains: diplomatic policy 
(e.g, relations with other countries and international 
organizations), defense policy (including the recruit- 
ment and deployment of troops, military aid, and the 


determinants of policy makers’ preferences should still be of interest 
so long as those preferences have any substantial impact at all on 
policy 

Another possible concern in using these survey data is that some 

elite respondents undoubtedly delegated answering the CCFR ques- 
tions to staff members We believe that subordinates’ responses are 
generally likely to reflect the views of the superiors who hire, pro- 
mote, and supervise them. 
? All items were dichotomized, using a quasi-random dichotomiz- 
ing scheme to group responses when more than two alternatives 
were Offered. We also alternated the polarity of the responses that 
were tabulated—i e., whether we tabulated the percentage m “favor,” 
the percentage “oppose[d],” or the percentage favoring one extreme 
combined with those favoring one or more neutral or middle alter- 
natives The percentage of relevant respondents making the selected 
response or combination of responses was then recorded as the value 
of the dependent variable on that issue for that given year. 

The quasi-random dichotomuzaton and alternation of polanties 

mean that the direction of preferences on different issues does not 
have an intuitive meaning in common. It 1s not always the case, 
for example, that higher percentages signal more “liberal” or more 
“Internationalist” responses But this procedure was necessary m 
order to ensure substantial variance and to avoid certain statistical 
biases in the analyses. Dichotomization was essential to ensure that 
the percentage support measure was uniquely defined. 
10 The precise relationship between (for example) the percentage of 
respondents favoring an “increase” in foreign aid and the amount 
of aid increase favored by the average respondent is likely to be 
complex and related to aspects of survey responses and underlying 
preferences about which we have little informaton Our general 
point is that the percentage of policy makers “favoring” alternative X 
1s likely to track the amount of X favored by the average respondent, 
which is more directly applicable to actual policy and of more central 
interest. 
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development, procurement, and transfer to other coun- 
tries of military hardware), and economic policy (e.g., 
issues related to trade, tariffs, and the protection and 
promotion of American jobs and businesses). Diplo- 
matic and defense policy questions were asked most 
frequently (214 and 209 items, respectively, asked of 
both the public and elites), followed by economic policy 
(144 items). 

Since our aim is to sort out the independent impact 
of each factor and to compare them with each other, 
we relied primarily on multivariate regression analyses. 
The meaningful and intuitively understandable units of 
measurement involved (percentage points on the fa- 
miliar 0 to 100 scale) led us to focus on unstandardized 
ordinary least squares (OLS) regression coefficients, 
which can tell us how many percentage points of change 
in policy makers’ support for a policy are typically as- 
sociated (controlling for all other factors) with a one- 
percentage-point increase in support by (for instance) 
business respondents. A coefficient near zero would 
signal no influence at all on policy makers by business, 
whereas a coefficient near 1.0 would signal very great 
influence. We conducted these regression analyses for 
all years combined (the most stable and reliable set 
of estimates) and for each separate year, for all policy 
makers combined and for each separate institutional 
subgroup of policy makers, for all issues together and 
for each of the three issue categories separately, and for 
issues grouped according to varying degrees of salience. 

We employed four types of regression models involv- 
ing distinct analytic approaches. First, in Model 1 we 
conducted purely cross-sectional analyses, regressing 
the preferences of policy makers at a given time on 
those of business people, experts, labor leaders, and or- 
dinary citizens at that same time. These cross-sectional 
analyses generally used all eight pairs of surveys to- 
gether, which included a total of 567 in-common items 
ascertaining preferences about foreign policy: that is, 
567 questions about policy preferences that were asked 
in a given year, with identical wording, of both citizens 
and elites‘! The number of items varied from a high 
of 112 in 2002 to a low of 48 in 1986. For each of the 
567 issue cases we constructed percentage measures 
of the aggregate policy preferences of policy makers, 
the general public, business people, labor leaders, and 
foreign policy experts (as well as other groups) and 
analyzed predictors of the variation in policy makers’ 
preferences across issue cases. 

Model 1-style cross-sectional analyses have some 
usefulness. They may be particularly helpful for cap- 
turing quick or near-instantaneous effects: If, for ex- 
ample, prominent experts or business leaders were to 
gain rapid access to policy makers and influence them at 
once, with the effects fading as new circumstances arise. 
But they are less apt to identify slow-acting or delayed 


11 Questions asked in more than one year are treated as separate 


cases The public was asked a number of additional questions not 
analyzed here. We excluded questions not related to preferences 
about future government policy, such as those concerning past per- 
formance, US. “vital interests,” or “feeling thermometer” ratings of 
countries. 
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effects by other actors. Moreover, cross-sectional anal- 
yses—absent simultaneous equation techniques’*—are 
notoriously subject to causal ambiguity. And they can- 
not deal with the possibility that past government de- 
cisions may structure or “lock in” the positions of cur- 
rent officials, as in the incrementalism of government 
budget making (Wildavsky 1975). We need a regression 
model that incorporates any “inertial” forces in foreign 
affairs and, to the extent possible, controls for them. 

Our second type of regression model (Model 2) 
keeps the basic cross-sectional design for independent 
variables but addresses the potential for incremental, 
self-reinforcing dynamics by lagging the dependent 
variable and studying the effects of foreign policy deci- 
sion makers’ previous preferences. For this purpose we 
need data on the history of our dependent variables. 
Fortunately, the CCFR surveys allow us to identify 
many “two-time period” cases—that is, pairs of identi- 
cal survey items that were asked of both the mass public 
and elites in two sequential surveys. We identified 252 
sequential pairs, which permit us to conduct regres- 
sion analyses that lag the dependent variable for one 
period—using responses to the same policy preference 
question when it was asked in the survey four years ear- 
lier. In such regressions, policy makers’ 2002 support 
for economic foreign aid (for example) is predicted by 
the policy makers’ 1998 views on aid, together with the 
2002 preferences of the mass public, business and labor 
leaders, and experts. 

Model 2 analyses allow us to examine the impact 
on policy makers of contemporaneous preferences of 
the general public, business and labor leaders, and ex- 
perts, while taking account of the effects of the past 
policy preferences of government officials. But there is 
danger that using the lagged dependent variable may 
introduce excessive controls for prior influences, par- 
ticularly quick-acting influences that may already have 
been incorporated in the prior value of the dependent 
variable and are then “controlled” away. In addition, 
Model 2 (like Model 1) remains prey to causal ambi- 


guity. 

Statistically disentangling whether public opinion 
and nongovernmental elites affect policy makers, or 
whether the reverse happens—policy makers influence 
the preferences of others—is a daunting challenge. One 
promising approach is to track temporal sequences 
(i.e., to see whether changes in hypothesized indepen- 
dent variables actually precede changes in the depen- 
dent variable) and to employ the logic of “Granger 
causality” (i.e., to see whether the history of hypothe- 
sized independent variables actually adds explanatory 
power to that of the history of dependent variables; see 
Freeman 1983). 

Our third type of regression model (Model 3) uses 
the data from sequential pairs of survey questions 
to conduct two-observation time series analyses, with 
lagged values for independent as well as dependent 
variables. Although the restriction to two time points 


12 We have not been able to specify, with sufficient theoretical con- 
fidence, exogenous variables for which it is feasible to obtain good 
measures over the diverse issue cases and identify reciprocal effects 
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prevents us from using the full power of time series 
techniques, we can at least begin to get at the logic 
of causal time asymmetries and Granger causality. For 
example, in such a regression the 2002 preferences of 
policy makers concerning foreign economic aid are 
predicted by the 1998 preferences concerning aid of 
the public, experts, labor and business leaders, and pol- 
icy makers themselves. Since causes generally precede 
rather than follow effects, this time asymmetry adds to 
our confidence about causal inference. Incorporating 
the most recent history of the independent and depen- 
dent variables begins to get at Granger causal logic. 

The Model 3, two-observation time series analyses 
take good advantage of the CCFR data but are not 
without limitations. First, because of the four-year in- 
tervals between surveys, the time lags are relatively 
long. They may not allow us to distinguish the different 
speeds and durability with which different groups may 
exercise influence. For instance, major, internationally 
oriented business leaders or prominent experts might 
be able to exercise influence especially quickly because 
they are highly attuned to policy impacts and have 
privileged access to policy makers. By contrast, labor 
leaders might tend to have only delayed—but possi- 
bly long-lasting—influence due to reliance on building 
up pressure in congressional districts. Second, we lack 
a full account of the history of the independent and 
dependent variables and therefore cannot explore dif- 
ferent lags or conduct full Granger tests. We can only 
lag our variables one period. Finally, the need for iden- 
tical questions in two sequential surveys decreases the 
number of cases available for analysis, which reduces 
the precision of estimates and makes it more difficult 
for results to attain statistical significance. 

The fourth type of regression model (Model 4) pur- 
sues the logic of Model 3 one step further: It uses 
change measures for dependent variables and change 
measures for the independent variables. In particular, 
Model 4 examines whether changes in the views of 
business, experts, labor, and members of the mass pub- 
lic predict changes in the preferences of policy makers. 
For example, the changes between 1998 and 2002 in 
policy makers’ preferences regarding economic aid are 
predicted by the 1998 to 2002 changes in the aid pref- 
erences of the public, experts, and labor and business 
leaders. The strength of this model is that it is explicitly 
dynamic. Model 4 controls automatically for all possi- 
ble influences on the immediately prior values of de- 
pendent and independent variables. It also promises to 
give leverage on causal ordering; evidence that changes 
in the independent and dependent variables covary 
strengthens the basis for inferring causation. On the 
other hand, the necessary focus on changes in both 
dependent and independent variables over the same 
time period may reintroduce problems with recipro- 
cal causation: We cannot generally tell which change 
actually occurred first. 

Each of these four models offers a distinct approach 
to disentangling complex causal connections, based on 
the available opportunities and challenges of an un- 
usual data set. Rather than select one or two models 
for exclusive attention (Models 3 and 4, say), we have 
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pursued a strategy of methodological pluralism. We 
are particularly interested in exploring patterns across 
all four models. A convergence of estimates from four 
distinct models would strengthen our confidence in the 
results and would be very gratifying. On the other hand, 
differences among them might help establish bound- 
aries within which the true magnitudes are likely to 
lie. 


ANALYZING INFLUENCES ON FOREIGN 
POLICY MAKERS 


Before turning to the multiple regression models we 
examined simple bivariate relationships at one point 
in time between the aggregate preferences of all policy 
makers taken together and those of the mass public 
as well as each of the seven distinct clusters of “for- 
eign policy leaders” that the Chicago Council surveys 
repeatedly interviewed (business, labor, educators, pri- 
vate foreign policy organizations or think tanks, editors 
and journalists from the media, special interest groups 
relevant to foreign policy, and religious officials). All 
these bivariate relationships were positive, quite large, 
and highly significant at the p < .01 level. The percent- 
age of policy makers preferring a specific policy was 
most strongly correlated with the percentage prefer- 
ring the same policy among respondents from the me- 
dia (r = .94), business (r = .91), foreign policy organiza- 
tions and think tanks (r = .90), and educators (r= .90). 
Religious leaders (r=.85) and labor leaders (r= 
.84) came not far behind, with the general public 
(7 =.77) taking up the rear. The current preferences of 
policy makers were almost as strongly correlated (and 
highly significantly so) with the preferences of these 
same groups measured in the previous time period. 

The results for business are consistent with previous 
research on the role of organized interest groups in 
U.S. foreign policy, and the findings for think tanks 
and educators are in line with the analysis of epistemic 
communities. The relatively low figures for the public 
are somewhat surprising, however, given previous find- 
ings that policy makers are highly responsive to public 
opinion. 

There are reasons to doubt that the high bivariate 
correlations for religious leaders and the media really 
reflect major, direct impacts on the making of US. 
foreign policy. Few scholars have asserted that such 
impacts occur, and in theoretical terms, any such influ- 
ence would presumably be channeled mostly through 
the public. It seems possible that the high correlations 
were spurious or resulted from reciprocal relationships. 
The media, for example, have not often been identified 
as a direct influence on policy making, but considerable 
research does suggest the opposite causal connection— 
Le., influence by government officials on media that 
rely heavily on officials as news sources (e.g., Bennett 
1990, 1994; Entman 2004; Hallin 1986; Nacos, Shapiro, 
and Isernia 2000, part I; Sigal 1973). 

Not only did the preferences of all the elite groups 
surveyed by the Chicago Council correlate highly with 
the preferences of policy makers; but also they corre- 
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lated very highly with each other. Contemporaneous 
correlations tended to fall in the r=.90 to .92 range, 
and the correlation between educators’ and the media’s 
preferences reached the remarkable level of .96. Cor- 
relations with preferences at the previous time point 
were also quite high, though generally about a tenth 
of a point weaker. (The reduction in the predictive 
power judged by the Rê statistic is more substantial.) 
Substantively, this suggests the existence of something 
like a “foreign policy establishment,” in which policy 
preferences are largely shared across several different 
categories of elites engaged in foreign policy, while the 
general public stands somewhat to the side. (Contem- 
poraneous correlations between the preferences of the 
public and those of elite groups ranged from .83 [for 
labor] and .82 [for religious leaders] down to .68 for 
private foreign policy organizations and think tanks. 
Correlations with responses from the previous time 
period were similar, though a bit lower.) 

Methodologically, the high intercorrelations among 
elite groups augur possible trouble with multicollinear- 
ity. With such data, sorting out distinct effects in a pre- 
cise and reliable fashion is a difficult challenge. We now 
turn to our four multivariate models in order to attempt 
to estimate the independent impacts of the public and 
various elite groups on government officials. 

Our first step in cross-sectional multivariate analysis 
(Model 1) was to estimate what could uncharitably be 
called a “garbage can” model, with policy makers’ for- 
eign policy preferences as the dependent variable and 
with the contemporaneous preferences of the general 
public and of each of the CCFR’s seven distinct clus- 
ters of elites as independent variables. This regression 
produced results already hinted at by the bivariate cor- 
relations: rather substantial and highly significant coef- 
ficients for business (b= .31) and think tanks (b=.22), 
but a second-tier status for labor (b= .09; significant 
at only p <.05) and, especially, the public, which had 
a near-zero coefficient that was not statistically signifi- 
cant at even the .10 level. (Coefficients are significant 
at p < .01 unless otherwise indicated; n = 567.) 

The same “garbage can” approach using our other 
regression models generally seemed to confirm strong 
effects by business. Model 2, cross-sectional analysis, 
except for a lagged dependent variable (policy makers’ 
preferences from the previous survey), showed large 
and highly significant coefficients for business (b= .29), 
with the effects of labor and think tanks only about half 
as strong and the coefficient for the public failing to 
reach statistical significance. (The estimated effect of 
the lagged dependent variable was statistically signif- 
icant but only moderate; b= .18). Model 3, regressing 
the current preferences of policy makers on the four- 
year-previous preferences of the public, elite groups, 
and policy makers, seemed to confirm the influence of 
business (b = .27) and added new evidence of labor’s 
effects (b = .36)—perhaps indicating a delayed impact 
over time. The lagged dependent variable remained 
significant and became substantial (b = .35), but the 
coefficients for the public, educators, think tanks, the 
media, and religious groups did not reach statistical 
significance. 
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These “garbage can” analyses, however, ran afoul 
of two serious methodological problems. First, the co- 
hesiveness of the “foreign policy establishment” did 
indeed produce extremely high levels of multicollinear- 
ity, with VIF coefficients of more than 20 for the media 
and 15 for educators in the cross sectional analysis (a 
VIF coefficient above 10 is generally considered trou- 
bling; see Chatterjee and Price 1991). Second, the theo- 
retical rationale for the causal structure of these analy- 
ses is highly questionable. For example. they treated the 
preferences of media figures as a purely independent 
variable that directly influences government officials, 
even though previous research (as noted above) sug- 
gests the opposite causal connection. Direct impact by 
religious leaders also seems questionable. 

We refined our subsequent analyses to include only 
a theoretically solid core of independent variables. We 
dropped from analysis the media variable (which suf- 
fered from high multicollinearity and causal ambigu- 
ity), religious leaders (also causally ambiguous), and 
“special interest groups” (poorly defined and showing 
little estimated impact). Further, we combined edu- 
cators with respondents from private foreign policy 
organizations and think tanks, whose preferences were 
highly correlated with each other (r = .92,p < .01) and 
played essentially the same roles in regressions when 
entered separately. The result was a single variable for 
the policy preferences of “experts,” which is consistent 
with research on epistemic communities. 

Subsequent regression analyses using this refined, 
parsimonious set of independent variables yielded re- 
sults that were considerably more satisfying, both sub- 
stantively and methodologically. 

In our first and most important refined analyses, we 
used all available issue cases from all surveys. The de- 
pendent variables were the foreign policy preferences 
of all policy makers combined and (separately) the 
preferences of each of the three clusters of officials: 
those from the administration, the House of Repre- 


TABLE 1. 
(Model 1, Cross Sectional) 
Varlable 

Constant 


Public; 
Business, 
Labor; 
Experts; 
Adjusted R° 


Number of cases 
F significance 
df 


—0.067 
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sentatives, and the Senate. These analyses included 
only the four independent variables needed to test the 
principal theoretical expectations we have discussed, 
namely, the preferences of the general public, business, 
labor, and foreign policy experts. 

For the cross-sectional models, each of these inde- 
pendent variables (with the possible exceptions of ex- 
perts and the mass public) can reasonably be treated as 
exogenous with respect to government decision mak- 
ers. The policy preferences of business and labor lead- 
ers are arguably rooted in economic interests and in 
well-developed values; they are not likely to vacillate 
with the particular officials currently holding office. 

rts,” on the other hand, may be cultivated and 
even selected by officials. The foreign policy prefer- 
ences of the mass public may be influenced by govern- 
ment officials as well. This would lend ambiguity to the 
interpretation of large coefficients for the public, but 
(as we will see) no such coefficients have been found. 
Estimates of a lack of influence are largely free of causal 
ambiguity. 

Tables 1 through 4 show that our regression models 
were rather effective in accounting for the variation in 
policy makers’ preferences. Adjusted R? values were 
all high, ranging from .70 to .90 for Models 1-3 and 
only moderately lower (as would be expected when 
using change scores) for Model 4. Taken together, the 
preferences of business, experts, labor and the public 
can account for the bulk of variation in the foreign 
policy preferences of policy makers, both contempora- 
neously and over time. This is particularly true for all 
policy makers together, but nearly as much so for the 
separate groups of officials. 


Business Influence 


The strongest and most consistent results in Table 1 
are the coefficients for business, which suggest that 


Influences on Forelgn Pollcy Preferences of Government Officlais 


Administration 


Senate 

2.988 
(1.72) 

0.03 


House 


(1.16) 
0 10° 
(0.04) 
043” 
(0.05) 
0.19% 
(0.05) 


(0.06) 
0.43" 


(0.07) 
0.21" 


(0.07) 


Note. Entries are unstandardized coefficients from OLS regressions (standard errors in parentheses), with the 
percentage of government officials who took a given position as the dependent vanable and the percentages 
of members of each of the listed groups who took that position In the same year as Independent variables. 


= p) < .01 and * p < .05 (two-talled test) 
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TABLE 2. 


Varlable 


House 
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Influences on Forelgn Policy Preferences of Government Officials 


Senate Administration 


(Model 2, Cross Sectional with Lagged Dependent Variable) 


All Pollcymakers 
Constant 


Public; 
Business, 
Labor; 

Experts; 

Govt offictals? , 


Adjusted R? 
Number of cases 
F significance 

dt 


Note. Entnes are unstandardized coefficients from OLS regressions (standard errors In parentheses). The 
dependent vanable is the percentage of government officials who took a gwen postion, the Independent 
Vanables are the percentages of members of each of the listed groups who took that position or the percentages 
of government officials who took that posttion m the previous survey. * p < 01 and * p < .05 (two-tailed test). 
“We lagged the preferences of the particular set of government officials who were belng examined in the 
dependent vanable. For instance, when we used all policy makers in the dependent vanable, the preferences 
of all policy makers in the previous survey were included as an independent varlable. 


internationally oriented business corporations are 
strongly influential in U.S. foreign policy. According to 
the Model 1 cross-sectional analysis, when the support 
among business people for a given foreign policy is 
higher by 10 percentage points, the support among pol- 
icy makers (taken together) tends to be higher by about 
five points Business preferences were the strongest 
predictor of officials’ preferences within each of the 
three separate institutional arenas, peaking at a .71 co- 
efficient for administration officials (where it was more 
than double the only other significant coefficient, that 
for experts). To the extent that the executive branch 
dominates foreign policy, extensive influence by busi- 
ness on administration officials would be especially 
striking. ‘This finding is consistent with the expectations 
of many international relations scholars who focus on 
interest groups, but not with the expectations of those 
who envision an autonomous executive. 

Taking account of possible inertial forces in policy 
making by lagging the dependent variable, Model 2 
produces a similar story of relative strong business 
effects on all groups of policy makers separately or 
together, especially administration officials (Table 2). 
The consistently significant and moderately strong co- 
efficients for policy makers’ preferences lagged one 
period indicate that the contemporary views of policy 
makers are indeed influenced or conditioned by the 
history of already established perspectives. Business 
leaders, then, apparently exert some effect on govern- 
ment Officials that is quite independent of whatever 
impact they have also had on past policy that is carried 
forward through the self-reinforcing quality of previ- 
ous decisions. 

The Model 3 time series analysis in Table 3 provides 
a stronger test of whether the views of business “cause” 





changes in the foreign policy preferences of policy mak- 
ers, according to the logic of Granger models. Regress- 
ing the current preferences of all policy makers (as well 
as each separate group of officials) on the independent 
and dependent variables from the four-years-previous 
survey largely confirmed the major impact of business, 
with one wrinkle. These results suggest that business 
leaders are most influential on officials in the Senate 
and, especially, the administration, while possibly not 
exerting any meaningful effect at all on officials in 
the House of Representatives. As we have noted, 
Model 3 may actually underestimate the impact of busi- 
ness, if such impact occurs quickly, since part of that 
effect may be “controlled” away by the inclusion of 
a lagged dependent variable that already embodies it. 
Still, because of Model 3’s use of time asymmetries to 
pin down the causal direction of influence, the some- 
what lower business coefficients in Table 3 may repre- 
sent our best estimates. 

Model 4 offers a further perspective by using change 
measures (changes from one survey to the next in per- 
centage support for a given policy) for both the depen- 
dent and the independent variables. Table 4 shows that 
under this model, too, the basic pattern of substantial 
business influence holds. Business people (along with 
experts) are estimated to exert the strongest effects on 
policy makers overall and, especially, on administration 
officials. It indicates that when business people change 


D Our failure to find an effect that is clearly statistically significant 


is not, of course, the same as a definite finding of no effect. This 
ts particularly true in a case like the Model 3 estimate of business 
influence on the House, where the coefficient may be 

significant (at p < 10 by two-tailed test or p < .05 by one-tailed 
test). 
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TABLE 3. Influences on Forelgn Pollcy Preferences of Government Officlals 
(Model 3, Lagged Independent Variables) 


Varable All Policymakers House Senate Administration 
Constant 7.454" 


Publicy_; 
Business, 


Labort 


Experts, ; 
Govt. officialst , 


Adjusted R° 

Number of cases 

F significance 

gi -A D A A A A S A a 
Note. Entries are unstandardized coefficients from OLS regressions (standard errors in parentheses). The 
dependent vanable is the percentage of goverment officals who took a given positon; the Independent 
vanables are the percentages of members of each of the listed groups who took that position In the previous 
survey or the percentages of goverment officials who took that posttion in the previous survey “* p < 01 and 
*p < 06 (two-talled test), + p < .10 (two-tailed test equivalent to p < 05 by 1-tarled test) 

“ We lagged the preferences of the set of government officials who were belng examined in the dependent 
vanable. For Instance, when we used all pollcy makers as the dependent vanable, the preferences of this 
group of government officials in the previous survey were Included as an independent vanabie. 


TABLE 4. Influences on Forelgn Pollcy Preferences of Government Officlals 


(Model 4, Change Scores for Independent and Dependent Variables) 
Varlable All Pollcymakers, House, Senate, Administration, 
Constant 0.032 0.079 
(.085) 
0.34" 
(0.10) 
0.28" 
(0.08) 
0.11 
(0.08) 
0.37" 
(0.09) 


050 

212 
53.46" 

211 
Note: Entnes are unstandardized coefficients from OLS regressions (standard errors In parentheses) ™ p < 
.01 and * p < 05 level (two-talled test), + p < .10 (two-talled test equivalent to p < 06 by one-talled test). 
4 The independent and dependent vanables were calculated by subtracting the percentage of indmduals 
within a given group who took a given poston at Time 1 from the percentage of members of that group who 


Publlc, 
Business, 
Labor, 


Experts, 


Adjusted R? 
Number of cases 
F significance 
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took that position at Time 2 


their support for a given foreign policy by 10 percentage 
points, policy makers taken together tend during the 
same period to change their support by 3.2 points in 
the same direction, while administration officials alter 
their views by 4.6 points in a congruent direction. Such 
an institutional differentiation of business influence is 
foreseen by some previous research in international 
relations, which stresses the particular responsiveness 
to business of administration officials seeking to im- 
prove economic conditions and encourage voters to 
reward (rather than punish) the president as they retro- 
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spectively evaluate the incumbent’s performance. The 
lower R? statistics in Table 4 reflect the greater analytic 
challenge of explaining changes (Model 4) as opposed 
to levels (Models 1-3); the variance in both indepen- 
dent and dependent variables is much smaller and a 
higher proportion of it is random noise, especially for 
the aggregate preferences of relatively small groups 
like Senators and administration officials. 

A clear message from all four models is that busi- 
ness is a consistently effective influence on policy mak- 
ers. Its absolute and relative influence does appear to 
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vary, though, based on how the different models treat 
the passage of time. The contrast between the cross- 
sectional and the time-lag models suggest that business 
leaders may exert strong effects quite rapidly, perhaps 
through phone calls and other direct contacts with of- 
ficials. 


Experts 


Experts, judging by some of our models, also appear to 
be a potent force affecting the views of policy makers. 
According to cross-sectional Models 1 and 2, experts 
have the second strongest contemporaneous influence 
(next to business) on policy makers’ preferences (see 
Tables 1 and 2). This finding, which at least partly fits 
the expectations of researchers on epistemic commu- 
nities, applies with nearly equal magnitude to all policy 
makers taken together and also to those from each 
of the three institutional settings analyzed separately. 
But according to the time series analysis of Model 3, ex- 
perts did not exert any significant influence at all. This 
suggests either that experts’ impact is extremely quick, 
fully embodied in policy makers’ previously measured 
preferences, or (perhaps more plausibly) that the ex- 
pressed preferences of experts are largely effects rather 
than causes of policy makers’ stands. That is, the sub- 
stantial estimated effects in cross-sectional Models 1 
and 2 may be inflated by simultaneity bias. The same 
problem may affect the results of Model 4, which in- 
dicates that changes in experts’ and policy makers’ 
preferences over the same time period tend to go in 
the same direction. Since these changes are measured 
between the same two time points (previous and cur- 
rent surveys), we cannot be sure which change actually 
preceded or caused the other. Still, we cannot dismiss 
the possibility that experts and epistemic communities 
have substantial influence on officials. 


The Publlc 


Perhaps the most surprising finding in this whole set 
of analyses is the apparent weakness of public opin- 
ion. Even with these reduced and refined models, the 
public does not appear to exert substantial, consistent 
influence on the makers of foreign policy. In none of 
the 16 regression analyses, applying our four models 
to four sets of decision makers, was the public opin- 
ion coefficient dominant over those of the elite actors. 
The contemporaneous Model 1 estimate for the ef- 
fect of public preferences on officials in the House of 
Representatives was statistically significant but weak 
(b = .10); the public had no significant effects, even 
by a one-tailed test or the loose p < .10 standard, in 
Model 2.“ The negative coefficients for public opinion 


14 We are reluctant to rely on the loose p < 10 criterion for statistical 
significance because, when many tests are performed, it can produce 
a number of false positives By the same token, we hesitate to rely on 
one-tailed tests at the p < .05 level (equrvalent to two-tailed tests at 
p < .10), because we are not sufficiently confident that the possibility 
of real negatively signed mfluences can be altogether ruled out the- 
oretically. Stull, we must also avoid the possibility of false negatives, 
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in the time series analysis (Model 3), if taken seriously, 
actually indicate that—controlling for the past views 
of governmental and nongovernmental elites—officials 
tend perversely to move away from public opinion. 
A more plausible interpretation of these borderline- 
significant coefficients, however, is that the public sim- 
ply has no effect at all. According to Model 4, public 
opinion resurfaces as a notable but secondary influ- 
ence on policy makers overall and on House officials, 
with no significant effect on Senate or administration 
officials. But even this degree of public influence may 
be overstated by the simultaneous-change analysis of 
Model 4: If policy makers have reciprocal effects on the 
public, a simultaneity bias may inflate the estimates. 

Taken together, these findings hint at partial confir- 
mation of the Founders’ expectations about the House 
of Representatives as “the people’s” chamber. More 
importantly, however, they run against the thrust of 
much past research that has found substantial impacts 
of public opinion. 


Labor 


Labor leaders are estimated by Models 1 and 2 to exert 
contemporaneous influence on foreign policy decision 
makers, but only weakly (b = .16 for all policy mak- 
ers taken together in both Model 1 and Model 2), 
and mainly on the legislative rather than the executive 
branch. Research on the role of interest groups in the 
making of U.S. foreign policy generally anticipates just 
such a modest role for organized labor. What is surpris- 
ing, though, is that in the Model 3 time series analysis 
labor emerges as the second strongest influence (next 
only to the inertia effect of officials themselves) on 
policy makers as a whole, with particular impact on 
officials in the House of Representatives. Model 3, 
making use of time asymmetries, offers particularly 
strong evidence regarding causation. Labor, despite 
its limited contemporaneous influence, is apparently 
able—perhaps owing to its active presence in states 
and localities—to apply delayed pressure on govern- 
ment officials, especially House members, who may be 
particularly sensitive to organized pressure within their 
decentralized districts. (Another possibility, as we will 
see, is that labor indirectly impacts the national agenda 
through experts.) This evidence of delayed effects is 
consistent with the absence of significant labor coeffi- 
cients in Model 4, because delayed effects of changes 
in labor’s stands will presumably show up only later, 
not in simultaneous changes by policy makers. 

The analyses in Tables 1 through 4 form the bedrock 
of our empirical investigation. But we also used the 
same four regression models to explore possible vari- 
ations in influence on policy makers during differ- 
ent time periods and across three domains of for- 
eign policy—defense, diplomacy, and economic issues. 


eg, dismissing public opinion as having “no effect” just because 


its coefficients do not pass our relatively stringent tests. For that 
reason, all tables mclude (along with standard errors) formation 
on significance at the p < 10 level by two-tailed test, which readers 
are free to interpret as they will. 


117 


Who Influences U.S. Foreign Policy? 


Separate analyses for each of the eight survey years 
produced results that generally paralleled those we 
have displayed for all years combined, with some ap- 
parently random variation in coefficients due to the 
smaller numbers of cases. For the different issue do- 
mains, too, the same pattern of findings generally held: 
substantial business influence, contemporaneous but 
not lagged effects by experts, and delayed effects by 
labor, with little impact from the public. In the Eco- 
nomic realm, however, where foreign policy cuts close 
to home, public opinion did appear to have signifi- 
cant contemporaneous influence on policy makers: a 
modest coefficient of .17 in Model 1 but a more sub- 
stantial .36 in Model 2 and a remarkable .93 in same 
period-change Model 4 (all significant at p < 01). (In 
the Model 3 time series analysis, however, the public’s 
coefficient did not come close to significance.) With 
regard to defense policy there were some indications, 
consistent with previous literature (e.g., Snyder 1991), 
that business has a particularly strong effect on pol- 
icy makers. Because of the limited number of cases, 
interpretation of the issue-specific results requires 
caution. 


Searching for Public Influence 
on Government Officlals 


Given the surprisingly weak estimates of influence by 
public opinion (with the partial exceptions of effects 
on House members and in economic policy), we were 
concerned that real public influence might be masked 
by the inclusion in the analysis of labor leaders, whom 
we found to have a moderate influence on policy mak- 
ers and whose preferences were fairly strongly corre- 
lated with those of the public (r = .82). In order to 
provide the best possible opportunity for public opin- 
ion to display effects, we dropped the labor variable 
and regressed the preferences of policy makers only 
on the preferences of the general public, business, and 
experts. 

The proportion of variance accounted for was vir- 
tually the same as in the earlier analyses, indicating 
that the public and labor can indeed be substituted 
for each other without losing much predictive power. 
But even with these rather generous—and somewhat 
implausible—causal assumptions (namely, that labor 
leaders have no independent impact on foreign policy 
at all but merely act as statistical proxies for the gen- 
eral public), the estimated influence of public opinion 
on the preference of government officials remained, at 
best, modest. 

The Model 1 cross-sectional analysis excluding labor 
produced coefficients for public influence on all pol- 
icymakers (b = .11), officials in the House (.21), and 
Senate officials (.12) that were statistically significant 
and higher than in the earlier analysis But they re- 
mained smaller than the comparable coefficients for 
business (.50 for all policy makers, .40 for House offi- 
cials, and .40 for Senate officials) or experts (.40, .41, 
and .44, respectively; all significant at p < .01). Much 
the same thing (though with somewhat higher pub- 
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lic coefficients) was true of contemporaneous-change 
Model 4.5 And in Model 2 the estimated effects of 
public opinion mostly vanished, with only a low co- 
efficient for public influence on the “people’s House” 
(.12) remaining statistically significant. In Model 3, our 
touchstone for “real,” over-time influence, public opin- 
ion had no significant effects at all. None of the four 
models indicated any clearly significant public influ- 
ence on the administration, the main center for foreign 
policy decision making. 

Parallel labor-excluded analyses across the three pol- 
icy domains mostly continued to show the public as 
a minor influence, with less impact than experts or, 
especially, business, except on economic policies. 

Other results hinted at support for the 
Schattschneider (1960) view that the public has more 
influence on high-salience than low-salience issues, but 
estimates of public impact remained modest even for 
the very highest-salience issues, where—according to 
all four models—the estimated impact of both business 
and experts remained substantially greater than that 
of the public. 16 

In short, in spite of generous model specifications, 
the effect of public opinion on the preferences of for- 
eign policy makers appears to be—at best—modest, 
when critical competing variables are controlled for. 
In general, public opinion takes a back seat to business 
and experts. These results challenge research that has 
suggested a generally strong public impact on foreign 
policy. 

Still, we are not suggesting that public opinion has 
no effect at all on U.S. foreign policy. We have noted 
indications of significant public impact on members 
of the House of Representatives, on economic policy, 
and (perhaps) on issues of especially high salience. The 
public may play a substantial part in highly salient ques- 
tions of war and peace, such as those analyzed by Sobel 
(2001). Further, we have not explored possible effects 
by the public on agenda setting, or on decision makers’ 
anticipation of later, retrospective public opinion, or 
on the rhetorical packaging of policy choices (Jacobs 
and Shapiro 2000). (The makers of foreign policy may, 
for example, work hard to avoid high levels of military 
casualties that could provoke electoral punishment; see 
Mueller 1973.) But it is worth emphasizing that our 
measurements of public opinion, based on large sample 
surveys, are quite good; any attenuation of coefficients 
due to measurement error should affect elite groups 
more than the public. And this sort of result is not likely 
to result from model misspecification due to causal am- 
biguities. An erroneous finding of noninfluence by the 


15 In the Model 4 analysis excluding labor, coefficients for effects on 
all policy makers, House members, Senators, and the administration 
(respectively) were 32", 37, 31+, and 25+ for the public, 33", 
29, 21, and .45™ for business; and 41, 40, 44, and 42™ for 


experts. 

16 In regreamons hmited to the most highly salient issue cases (those 
with 4% or fewer “don’t know” responses), coefficients for the public 
neared or attained statistical significance only under Model 1 (.21**) 
and Model 4 (33+) The coefficients for different salience categones 
are highly unstable because of small Ns 
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public is considerably less likely to result from simul- 
taneity bias than is an excessively large estimate of its 
influence.’ 

Allin all, the implications of our findings for previous 
research connecting public opinion and policy making 
are sobering. 


Indirect Business and Labor Influence 
through Experts? 


The substantial influence of experts suggested by our 
cross-sectional models and the simultaneous-change 
Model 4, though in line with past research on epistemic 
communities, is subject to doubt about its causal status. 
Even if one sets aside the possibility of reciprocal influ- 
ences by officials on experts that would tend to inflate 
their apparent impact on officials (a possibility high- 
lighted by the null findings from the Model 3 over-time 
analyses), there remains the question of whether the 
preferences of experts are a truly independent variable 
or whether they function in an intervening role. For 
example, experts might themselves be influenced by 
business or labor and, in turn, transmit the preferences 
of those groups to officials. 

Researchers on epistemic communities sometimes 
assume that policy makers are guided by indepen- 
dent, objective analysis from new “knowledge-based 
elites”: that experts are not merely vehicles for press- 
ing Officials on behalf of organized interests. Yet the 
widespread funding of think tanks by business—and, to 
a much lesser extent, by organized labor—suggests that 
interest groups may sometimes affect who becomes 
a recognized expert and what such experts say. Thus 
experts might not be autonomous influences on pol- 
icy makers but, instead, might—in whole or in part— 
convey to officials the preferences of others. 

Without additional data we cannot hope to defini- 
tively untangle these causal complexities, But we al- 
ready know, from our discussion of a possible “foreign 
policy establishment,” that the policy preferences of 
experts are not statistically independent of the prefer- 
ences of business or labor. They are quite highly cor- 
related. The foreign policy preferences of think tank 
respondents, for example, were correlated at r = .90 
with the contemporaneous preferences of business and 
r = .80 with those of labor. 

If we assume one-way causation from business and 
labor to experts,!® we can go further and estimate the 
independent effects of each, through regressions in 


17 Reciprocal effects of government officials on public opmuon, 
for example, would bias our croes-sectional and contemporaneous- 
change estimates of the public’s influence on officials upward rather 
than downward. 

18 We consider the assumption of one-way causation from business 
and labor to experts to be fairly plausible, because the deeply rooted 
economic interests of business and labor are likely to make them 
resistant to others’ influence on their foreign policy preferences Still, 
mterest groups may provide financial support for experts in order to 
get serious advice (not just to persuade other audiences) and may, 
therefore, sometimes be influenced by them. To the extent that this 
occurs, our estimates of effects upon experts will be biased upward 
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which experts’ preferences are the dependent variable. 
The results, shown in Table 5, are quite striking. The 
cross-sectional Model 1 analysis indicates that busi- 
ness had a highly significant and quite high coefficient 
(b = .61, p < .001), and labor was not far behind (b = 
.42, p < .001). Taking into account the self-reinforcing 
quality of experts’ previous views (with Model 2), busi- 
ness’ estimated effect on experts was nearly twice the 
magnitude of labor’s (b = .48 versus .27). Much the 
same was true in Model 4’s simultaneous-change anal- 
ysis. In the time series analysis of Model 3, however, 
the four-year-lagged views of business showed no sig- 
nificant effect, which suggests that business’s influence 
on experts (if any) is felt relatively quickly. 

The results in Table 5 suggest that organized groups 
do influence experts and that this impact is quite sub- 
stantial. If we set aside the time series findings of 
no expert effects on policy makers, and accept the 
cross-sectional estimates of substantial contemporane- 
ous effects, we can go on to estimate indirect effects 
that business and labor may have on public officials 
through their influence on experts. For example, if (as 
the Model 1 cross-sectional analysis in Table 5 indi- 
cates) a 1-percentage point increase in business sup- 
port for some policy generally leads to a .61-percentage 
point increase in experts’ support for that policy, and 
if (as Table 1 indicates) such a .61-percentage point in- 
crease for experts would then lead to a .183 (.30 x .61)- 
percentage point increase in officials’ support, business 
would obtain a small but not irrelevant increment of 
indirect clout. This estimated indirect impact of .183 
can be added to the direct impact of business on policy 
makers (estimated at .52 points in Table 1), yielding 
a total business impact of slightly over .70. That is to 
say, a 10-percentage point increase in business sup- 
port for a given policy may lead to a 7-point, rather 
than >-point, increase in Officials’ support for that 
policy.! 

Similar calculations for labor based on Model 1 
cross-sectional analyses indicate a .126 indirect effect 
of labor on officials. Adding that to the estimated direct 
effect of .16 given in Table 1, the total impact of labor 
on foreign policy officials is about .29, nearly double 
the size of the direct effect alone.” This amounts to 
a surprisingly strong impact, given labor’s reputation 
for general ineffectiveness in American politics. It is an 
impact that appears to occur with some delay over time 
(recall Table 3) and to depend significantly on indirect 
influence through experts. Even so, the estimated total 
effect on foreign policy of business is more than twice 
that of labor. 

Methodological uncertainties mean that we should 
not take the magnitudes of these estimates as gospel. 


19 By smilar calculations, Model 2 (including a lagged dependent 
variable) produced estimates of a total business impact on policy 
makers of 56: an indirect rmpact through experts of .115 ( 24 x .48) 
pios a direct ımpact of .44 (see Tables 2 and 5). 

According to Model 2 analyses with a lagged dependent variable, 
labor has a slightly smaller total mmpact of .22: a .06 indirect effect 
through experts (.24 [Table 2] x 27 [Table 5]), combined with a direct 
effect of .16 (Table 2). 
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TABLE 5. Effects of Business and Labor on the Forelgn Policy Preferences of Experts 


Model 1. 


Experts 


Model 3. Analysis with 


Cross-Sectional 
Analysls 
—0.774 

(1.05) 
0.42" 
(0.03) 
0.61% 
(0.03) 


Constant 0.216 


(1.28) 
Labor; 
Business; 
Labor; 
Business; 
Experts: 
Labor, 
Business, 
Experts. 
Labor, 


Business, 


Adjusted AR? 0.87 0 92 
Number of cases 482 212 


1578.74" 768.47" 
481 211 


F significance 


Model 2. Analysis 
with Lagged 
Dependent Vanable 


Model 4. Analysis 
with Change 
Scores 


Independent and Dependent 
Variables Lagged 
6.351" 

(1.81) 


0.82 
212 
319.74 
211 


Note Entnes are unstandardized coefhcents from OLS regressions (standard errors in parentheses), with the Independent and 
dependent vanables measured as the percentage of each group taking the a grven positon on a given Issue or, in the case of Model 4, 


changes In those percentages “* p < .01 (two-tailed test). 


Still, they pose a challenge for those who consider the 
views of experts to be altogether autonomous. 


FOREIGN POLICY, THE NATIONAL 
INTEREST, AND DEMOCRACY 


Our analyses using four distinct types of statistical mod- 
els indicate that internationally oriented business lead- 
ers exercise strong, consistent, and perhaps lopsided 
influence on the makers of U.S. foreign policy. The 
estimates of strong business influence hold up under 
different models, for different political and institutional 
conditions, and for different time periods. They hold for 
high- as well as low-salience issues, for a variety of sub- 
stantive issue areas, and with respect to different insti- 
tutional groups of policy makers (though especially for 
executive branch and Senate officials). These findings 
tend to confirm theoretical expectations and empirical 
research from the organized interest group literature 
in international relations. 

The estimated impacts of experts on policy makers 
do not generally match those of business, but they, too, 
are quite substantial, according to three of our four 
models (though not the Model 3 analyses of impact 
over time). This lends some credence to claims by 
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analysts of epistemic communities. Our further inves- 
tigation of who influences experts suggests, however, 
that organized groups—not just independent, objec- 
tive evaluations of complex international realities— 
may color the views of experts. These findings indicate 
that the direct foreign policy clout of business and labor 
may be augmented by an indirect influence on policy 
makers that works through experts. 

Labor, even taking into account its possible indi- 
rect influence through experts, has less impact on the 
makers of U.S. foreign policy than business does. This 
finding, too, fits with some previous work by interna- 
tional relations scholars who focus on organized in- 
terest groups. Nonetheless, labor leaders do appear to 
exert a surprisingly consistent (if secondary) influence 
on policy makers, especially on members of the House 
of Representatives and concerning particular types of 
issues. 

The findings from our cross-sectional and simul- 
taneous-change analyses, as contrasted with the time 
series analyses using lagged independent variables, 
suggest that the influence of business tends to be 
fairly quick, while labor’s influence tends to be de- 
layed and is exerted over time as labor makes its pres- 
ence in congressional districts felt. Although labor’s 
effect on foreign policy has been downplayed by some 
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international relations scholars, our evidence of la- 
bor’s impact is consistent with recent reevaluations 
of its nested impact on US. social welfare policy 
(Gottschaulk 2001). 

To our surprise, public opinion—the aggregate for- 
eign policy preferences of ordinary citizens—was re- 
peatedly estimated by our Models 1-3 to have little 
or no significant effect on government officials. When 
effects of public opinion emerged in these models 
(namely, on economic issues, on members of the House 
of Representatives, and perhaps on very high-salience 
issues), the absolute and relative magnitudes of effects 
were generally modest. These findings of little influ- 
ence by public opinion are generally immune to issues 
of model specification and causal ambiguity that may 
affect some of our other results. On the other hand, 
the moderately strong estimates of public influence us- 
ing Model 4 (based on same-period changes) may be 
inflated by simultaneity bias and should probably be 
taken as establishing an upper bound on public effects. 
Even those estimates were generally smaller than for 
business or experts. These results contradict expecta- 
tions drawn from a large body of previous research. 

If one accepts the survey-measured aggregate policy 
preferences of foreign policy decision makers as satis- 
factory indicators of actual policy (and we have argued 
that it is reasonable to do so), our findings have several 
important implications for understanding U.S. foreign 
policy, international affairs, and American democratic 
governance. 

First, these findings underline the continuing im- 
portance of moving from monocausal explanations 
to multi-causal explanations in international relations 
scholarship (see Keohane 1989 and Putnam 1988). 
Although recent work in international relations (e.g., 
Moravesik 1997; Russett and Oneal 2001; Snyder 1991) 
has shown substantial progress in this respect, there is 
still a pressing need for comprehensive examinations 
of multiple possible determinants of U.S. foreign policy 
across a range of issues. 

Our evidence suggests that business may exert the 
most consistent influence on government officials but 
that policy makers’ views may also be affected by labor, 
experts, and, to a lesser extent, public opinion. These 
results suggest that three of the most prominent lines of 
analysis of foreign policy—the interest group, epistemic 
community, and public opinion approaches—each have 
some merit. But at the same time, research along each 
of these lines has tended to omit critical alternative 
variables. It has seldom systematically examined the 
relative impact of competing influences. This risks arti- 
ficially inflating estimates of the importance of partic- 
ular factors of interest. 

This hazard seems particularly serious for quantita- 
tive analyses of the effect of public opinion, not only 
on foreign policy (see, however, Hartley and Russett 
1992 and Ostrom and Marra 1986, which take steps 
to be multivariate) but also on U.S. domestic policy. 
For instance, Erikson, MacKuen, and Stimson’s widely 
discussed Macro Polity (2002) concludes that domes- 
tic policy is highly responsive to public opinion, yet it 
comes to that conclusion without considering the possi- 
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ble impact of organized interests. In our analyses, a very 
strong bivariate relationship between public opinion 
and the preferences of policy makers crumbled away 
almost completely when we included data on organized 
interests and experts in multivariate regressions. To be 
sure, foreign policy may be quite different from do- 
mestic (Art 1973; Wildavsky 1991), but we cannot be 
certain of this until comparable research is conducted 
that includes relevant variables and potentially com- 
plex causal dynamics (Bartels 2002; Jacobs and Shapiro 
2000; Page 2002; see also Druckman, Jacobs, and 
Ostermeier 2004 and Jacobs and Burns 2004). 

Second, our results have some troubling normative 
implications. The apparently weak influence of the 
public will presumably disappoint those adherents of 
democratic theory (e.g., Dahl 1989) who advocate sub- 
stantial government responsiveness to the reasoned 
preferences of citizens (Page and Shapiro 1992). Our 
findings indicate that the gravitational pull on foreign 
policy decision makers by the “foreign policy establish- 
ment” (especially business leaders and experts) tends 
to be stronger than the attraction of public opinion. 
This is consistent with the pattern of extensive and 
persistent “gaps” that Chicago Council studies have 
found between the foreign policy preferences of the 
public and those of policy makers. For example, or- 
dinary Americans, more than policy makers or other 
elites, have repeatedly expressed stronger support for 
protecting Americans’ jobs, stopping the inflow of il- 
legal drugs, and reducing illegal immigration, as well 
as for a multilateral, cooperative foreign policy based 
on bolstering the United Nations, working closely with 
allies, and participating in international treaties and 
agreements (Bouton and Page 2002; Jacobs and Page 
2003 


Comparatively muted influence by the mass public 
might initially please classical realist critics of citizen 
input into foreign affairs, who consider public opin- 
ion to be ill-informed and capricious (e.g., Lippmann 
1955). Yet those same critics would want to see policy 
makers rise above the politics of organized interest 
groups in order to pursue the “national interest,” per- 
haps as identified by independent, objective experts. 
Our finding of a substantial impact on foreign policy by 
business—generally a greater impact than by experts— 
suggests that purely technocratic calculations do not 
always predominate in the making of foreign policy. 
Competing political interests continue to fight over the 
national interest, and business often wins that compe- 
tition. 
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Welfare and the Multifaceted Decision to Move 
MICHAEL A. BAILEY Georgetown University 


ther poor single mothers move in response to welfare benefits has important implications for 

social policy in a federal system. Many scholars claim that welfare does not affect migration. 

These claims are not definitive, however, because the models underlying them rely on problem- 

atic assumptions and do not adequately control for nonwelfare determinants of migration. I address these 
shortcomings with an improved statistical model of individual-level migration. The results indicate that 
welfare does affect residential choice. Although the effects of welfare are much smaller than the effects 
of family ties, they are real and have the potential to cause nontrivial changes in welfare populations and 


welfare expenditures. 


or move to states with higher welfare benefits? 

This question has important implications on at 
least two levels. As a policy matter, the answer will 
enlighten us about the effects of welfare on society 
and will assist efforts to understand whether states 
systematically lower benefits in order to avoid becom- 
ing “welfare-magnets” (Peterson and Rom 1989; Rom, 
Peterson, and Scheve 1998; Volden 2002). This answer 
will also help us anticipate how welfare policies may 
evolve in other areas of the world where it is—or 
is becoming—as easy to move across jurisdictional 
boundaries as in the United States. 

As a theoretical matter, understanding the relation- 
ship between welfare and migration can help us bet- 
ter understand how the increasing mobility of people, 
firms, and capital affects governmental capacities to 
provide welfare and other redistributive benefits. If 
generous government benefits prompt people who re- 
ceive them to flow in and the people who pay for them 
to flow out, the benefits will become increasingly diffi- 
cult to sustain. This is true not only for a federal system 
such as the United States, but also for the international 
system in which political, social, and economic barriers 
to migration have fallen dramatically in recent years. 
Finding that welfare-induced migration occurs in the 
United States would enhance concern about govern- 
mental capacity for social services; finding no such be- 
havior, on the other hand, would make us less inclined 
to believe that migration constrains governments in 
other, less likely contexts. 

Lately, most scholars researching the question have 
found very little or no support for the idea that wel- 
fare affects migration in the United States (Allard and 
Danziger 2000; Levine and Zimmerman 1999; Schram, 
Nitz, and Krueger 1998; Schram and Soss 1999). They 
conclude that the evidence is “at best mildly in favor” 
of the idea that welfare affects migration (Brueckner 


N re poor single mothers more likely to stay in 
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2000, 519) or, more typically, that welfare-induced mi- 
gration is a “myth” (Allard and Danziger 2000; Schram 
and Soss 1999, 83). However, we should be cautious 
about accepting this emerging conventional wisdom. 
In a variety of ways, these studies fail to account for 
the complexities of migration and consequently run 
the risk of either obscuring the effect of welfare or, 
even worse, conflating the effect of welfare with the 
effects of other unmeasured factors. As I show be- 
low, this is precisely what has happened, the case in 
point being the link between welfare benefits granted 
through the Aid to Families with Dependent Children 
(AFDC) program in the late 1980s and the residential 
choices of poor single mothers, the program’s primary 
constituency. 


THREE HOLES IN THE EXISTING 
LITERATURE 


Assessing whether welfare affects migration is no sim- 
ple task, as attested by a vast, highly contested liter- 
ature (for a review, see Brueckner 2000). To do this 
convincingly, researchers must account for all the fac- 
tors other than welfare that affect people’s decisions to 
move. Researchers have made progress in this respect, 
but problems persist. Three issues undermine the re- 
cent wave of research that downplays or dismisses the 
effect of welfare on migration. 

First, many studies risk distorting the effect of wel- 
fare by inadequately accounting for state attributes that 
affect migration. These studies typically consist of sta- 
tistical analyses of either aggregated migration flows 
or individual migration choices. They control for state- 
level influences on residential choice through variables 
measuring such attributes as state economic perfor- 
mance and differences in state climates (e.g., Allard 
and Danziger 2000; Frey et al. 1996; Schram, Nitz, and 
Krueger 1998). 

This seemingly straightforward enterprise is actu- 
ally remarkably difficult. Consider, for example, the 
variables Schram, Nitz, and Krueger use to charac- 
terize nonwelfare components of state attractiveness. 
Florida—the quintessential high—population growth 
state—averaged 55% unemployment from 1985 to 
1990; its median income averaged $34,931 in nominal 
terms and $29,090 in state cost-of-living adjusted terms. 
Many states that were less attractive to potential 


125 


Welfare and the Multifaceted Decision to Move 


in-migrants looked similar or better in these terms: 
Rhode Island averaged 3.9% unemployment and 
$38,492 in nominal median income. South Dakota 
averaged 4.4% unemployment and $30,460 in cost- 
adjusted median income. Of course, one could add 
variables (e.g., “average temperature,” “murder rate”) 
and all manner of nonlinearities and interactions (e.g., 
“temperature squared,” “temperature x income”). 
Nonetheless, one cannot help but suspect that signifi- 
cant aspects of state attractiveness resist measurement. 

The danger is that studies with inadequate state-level 
controls will conflate the effect of welfare on migration 
with other factors. Recent demographic trends make 
this a particular concern. Americans tend to move 
from northern (“rust belt”) states with relatively high 
welfare benefits to southern (“sun belt”) states with 
relatively low welfare benefits. Failing to account for 
the complicated mixture of economic and social fac- 
tors behind such moves results in analyses in which 
the states where welfare is high are also the states 
where the unmeasured attractiveness of living is low, 
and vice versa. The statistical result is that unmea- 
sured disincentives to migrate to a state get lumped 
in with the observed (and correlated) welfare mea- 
sures, leading to estimates in which the effect of wel- 
fare appears to be small or inconsequential, even if it 
is not. 

Second, existing research fails to account adequately 
for individual-level factors that influence migration. 
Many individuals want to move “home” to the state in 
which they were born because, that is, where they are 
more likely to have family and to know the neighbor- 
hoods, schools, and industries. Moving home may have 
a particularly powerful appeal for single mothers, who 
often depend on the housing, childcare, financial assis- 
tance, and psychological support of parents, siblings, 
and friends (Allard and Danziger 2000, 358; Schram, 
Nitz, and Krueger 1998; Vartanian et al. 1999). In what 
follows, I refer to the attractions of home as “family 
ties”; some scholars refer to them as “social capital.” 

The data described below bear out these expecta- 
tions. Home is not just another variable; it is a funda- 
mental influence on migration. Fully one-third of all 
interstate moves by poor single mothers were to the 
individuals’ birth states. For many states, the propor- 
tion of in-migrants who were born there is extremely 
high: 54% of poor single mothers moving into Alabama 
from out of state had been born in Alabama. The 
comparable numbers were 57% for Louisiana, 58% 
for Mississippi, and 51% for West Virginia. (At the 
other extreme, only 12% of poor single mothers mov- 
ing into Florida or Nevada were returning to their state 
of birth.) 

Failure to account for the special attractiveness of 
birth states can lead researchers to understate or even 
reverse the true effect of welfare on migration. The 
reason is that single mothers were born disproportion- 
ately in poor, low-benefit states. If we fail to control 
for the attraction of home states for these women, we 
may mistake their fairly common moves home with a 
complete disregard for the low welfare benefits in their 
states of birth. 
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The omission of race in many studies raises simi- 
lar concerns. Individuals are more likely to move to 
states with larger numbers of racially similar people 
(Frey et al. 1996). In the data discussed below, there 
are about 60,000 poor white single mothers and about 
40,000 poor black single mothers, Of the whites, 510 
lived in North Dakota, South Dakota, or Vermont; two 
of the black single mothers lived in those states. If race- 
specific attraction to states correlates with welfare (as is 
likely if, for example, African Americans are relatively 
more attracted to low-benefit southern states that have 
relatively large African American populations), failure 
to account for such variables may introduce yet another 
source of omitted variable bias that can distort the es- 
timated effects of welfare on migration. 

Third, many studies aggregate away important state- 
level differences. Levine and Zimmerman estimate a 
model in which the dependent variable is whether an 
individual moved out of state. By ignoring whether 
the person left for a high-benefit state such as 
California or a low-benefit state such as Louisiana, 
this approach limits the ability of the method to as- 
certain the role of welfare. Meyer (2000) estimates 
a model in which migration across regions is the de- 
pendent variable, thereby treating states as identi- 
cal within regions. Depending on the specification, 
Meyer assumes that there are two or nine regions in 
the entire United States, implying, for example, that 
Maryland is identical to West Virginia and that New 
Hampshire is identical to New York. This assump- 
tion of intraregional homogeneity creates a chronic 
error in variables problem that likely will obscure 
relationships between variables such as welfare and 
migration. 

Each of these problems potentially obscures or dis- 
torts the estimated effect of welfare on migration. Ev- 
ery recent study that dismisses welfare effects suffers 
from more than one of these problems, meaning that 
the true effect of welfare is buried under multiple layers 
of specification error. To get a better sense of the true 
relationship between welfare and migration, I develop 
an analytical approach that directly addresses each of 
these issues. 


A MORE COMPREHENSIVE APPROACH 


At the heart of the analysis is a random utility model of 
individual-level migration choices. The model charac- 
terizes the utility for every individual of living in every 
single state. Specifically, the utility of living in state s 
for person i currently in state j consists of a deter- 
muinistic component vy, and a stochastic component 


eae 
Uss = Uys + Eys. (1) 


I estimate the model with a conditional logit setup 
(Greene 2000, 858). In the model, each individual se- 
lects the state that offers the highest utility. Assuming 
that the random shocks are independently and identi- 
cally distributed Extreme Value Type I random vari- 
ables, the probability that person : living in state j 
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chooses state s is 
Pys = Prob(Uys > Usk VEX 5), 2) 
= Prob(éy — Eys < Uys — UykY k Æ s), (3) 
eu 
= 4 
SE em (4) 


where K is the total number of states to which an 
individual can move. The computationally convenient 
form makes estimation conceptually straightforward 
(even as it is practically difficult, given that a very 
large number of individuals are choosing among a large 
number of discrete choices).’ By explicitly modeling all 
the state choices, I reduce the possibility that errors in 
variables obscure the effect of welfare on migration. 

I control for state attributes by using state-level fixed 
effects, i.e., by using state-level dummy variables to 
control for all state attributes that are the same for all 
individuals in the analysis. For example, these variables 
control for state unemployment and state climate be- 
cause for any given state, the values of these variables 
will be the same when modeling the probability that 
any individual will move to the state.” (A variable not 
encompassed by fixed effects varies for a given state 
across individuals; for example, only some people were 
born in New York, meaning that when modeling the 
utility of New York, the state-of-birth variable would 
be one for some individuals and zero for others.) The 
real advantage of fixed effects comes from their ability 
to subsume unmeasured variables and unspecified in- 
teractions. That is, fixed effects control for any attribute 
of a state—measurable or not—that affects all individ- 
uals in the same way. Thus the fixed-effect approach 
controls for state-level factors at least as well as—and 
usually better than—any approach relying on state- 
level covariates. 

When using fixed effects, one must make special ef- 
forts to distinguish the effect of welfare on migration— 
if any exists—from the more general attractiveness of 
states measured by fixed effects. If the sample includes 
only poor single mothers who are all eligible for Aid 
to Families with Dependent Children (“AFDC”), then 
the welfare generosity of each state will be the same 
for all individuals in the sample (in the same man- 


1 The model automatically satisfies the “independence of mrelevant 
alternatives” (ILA) condition. This condition implies that the ratio of 
probabilities of choosing one option to another 1s the same, whether 
or not a third option is included m the choice set. In an appendix 
available upon request, I discuss alternative estimation strategies 
and present results that indicate that the results are very similar m 
models that do not satisfy the ILA condition. 

2 To see this, first that the utility of a state depends only on a 
single variable (say “unemployment rate”) and that the coefficient on 
this variable is negative one. For every mdividual, the utility of lying 
In any given state would be negative one trmes the unemployment 
rate for the state A state-level fixed effect completely captures this 
amount If we add another state-level variable with a coefficient of 
two, say, the utility for all individuals of living in the state would be 
negative one times the unemployment rate plus two times the value 
of the new variable. Agam, a state-level fired effect would capture 
the utility value of a state. This reasoning directly extends to any 
number of state-level variables. 


ner that state unemployment and state climate were 
above). The state fixed effect will “soak up” the wel- 
fare effect and leave it statistically unidentified. I avoid 
this problem by using a quasi-experimental research 
design, sometimes referred to as a comparison group 
method (Levine and Zimmerman 1999; Meyer 2000). 
This design requires that I include in the sample a 
“control group” that is not eligible for welfare but oth- 
erwise resembles the “treatment group” of poor single 
mothers. General state attributes (captured by state 
fixed effects) influence individuals in the control and 
treatment groups; welfare, however, influences only in- 
dividuals in the treatment group. Given the inclusion 
of the control group, the welfare variable is no longer a 
constant for all individuals for any given state (that is, 
welfare benefits are zero for individuals in the control 
group and the measured value for individuals in the 
treatment group). The welfare variable now is statisti- 
cally identified; it allows us to see whether differences 
in welfare benefits explain any differences in behavior 
by the treatment and control groups. 

I also control for, among other factors, the gravi- 
tational pull of birth states and potential differences 
in the attraction whites and African Americans have 
toward states. Including these variables not only serves 
important statistical control purposes, but also human- 
izes the analysis by moving beyond the caricature of 
welfare recipients as solely motivated by financial gain 
(see the excellent discussion on this point in Schram, 
Nitz, and Krueger 1998). 


DATA 


Individual-level data are from the Census Bureau’s 
Public Use Microdata Series (PUMS) 1990 5% 
sample as accessed via Integrated PUMS (IPUMS) 
(Ruggles et al. 1997). This data set provides individual 
information on age, marital status, number and ages 
of children, income, race, education, birth state, and 
state of residence in 1985 and 1990. I work with the 
1990 data for two reasons. First, most studies of welfare 
migration assess AFDC in the late 1980s or early 1990s. 
I use data from that period in order to ensure that it 
is the methods—and not changes in reality—behind 
any new results. Second, the highly variable welfare 
environment from 1996 to 2000 makes it hard to draw 
inferences about migration based on average levels of 
benefits over that time period. In contrast, AFDC was 
quite stable from 1985 to 1990. 

The welfare population consists of 110,243 single 
mothers between 25 and 53 with children between 4 
and 18 who had an income less than 125% of the 
poverty leveL? Of these, 8.9% moved across state 


3 The earlier literature sometimes focuses on indrviduals who ac- 
tually recerve welfare. Meyer (2000, 5) details how doing so biases 
the results m favor of the welfare migration hypothesis. For exam- 
ple, some of the people who would not receive welfare in a low- 
benefit state could move to a higher-benefit state and receive welfare 
simply because ehgibility is easier in the higher-benefit state. This 
dynamic will exaggerate the flow of welfare recipients into high- 
benefit states and nonrecipients out of low-benefit states. Thus paper 
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lines between 1985 and 1990. The nonwelfare control 
groups reasonably match the welfare population in 
all respects except for eligibility for welfare. Follow- 
ing Meyer (2000) and Levine and Zimmerman (1999), 
I use three different control groups. The first con- 
sists of 69,270 childless single women between 25 and 
53 years of age who had less than three times the 
poverty income and no college degree.* The second 
control group consists of 96,684 childless single males 
who had less than three times the poverty income and 
no college degree. The third control group consists of 
122,681 married women with children with household 
incomes greater than three times the poverty level 
and below the lesser of five times the poverty level or 
$50,000. No group perfectly matches the welfare pop- 
ulation, but all match in some way the skill profiles and 
economic circumstances of poor single mothers. Using 
multiple specifications should increase confidence in 
the robustness of the results. 

The focal variable is welfare benefits measured as 
the sum of maximum AFDC benefits for a family of 
four and Food Stamps for each state.° The Food Stamp 
data are from the U.S. House Committee on Ways and 
Means (various years). I restrict the welfare effect to 
be zero for the control group by multiplying welfare 
benefits times a dummy variable indicating whether 
an individual is in the welfare population. This creates 
within-state individual-level variation in the welfare 
variable and allows it to be included in a model with 
state-level fixed effects. This is the critical variable for 
the welfare migration hypothesis. 

State wage data are the average retail wages for 
food stores from the Census Bureau (2000); data from 
this sector of the economy reflect the earnings poten- 
tial of low-skill women (Berry, Fording, and Hanson 
2003). State unemployment data are from the Bureau 
of Labor Statistics (2001). All state-level variables are 


follows Meyer’s recommendation (8) of using an “at-risk group (sm- 
gle mothers or, better yet, low-educated single mothers).” He also 
notes that “a substantial fraction of any at-risk group may not be 
likely welfare recuplents, and thus effects on the overall group are 
likely to be watered down estimates of the effects on likely partic- 
pants” Given the findings of this paper, it is reassurmg that the 
welfare population is identified ın a manner that biases agamst the 
welfare migration rather than in favor of ıt. Following the convention 
of this literature, I include only individuals who started and ended 
up in the continental United States, mcluding Alaska and Hawaii 
produces essentially the same results. The limits on children’s ages 
lmmit the sample to only those women who had children dunng the 
entire period from 1985 to 1990, earlier versions of this paper allowed 
for younger children and had smular resulta. The poverty level varies 
based on number of children m the family and other factors, the 
average poverty threshold in 1989 was $12,674 (IPUMS codebook 
[Ruggles et al. 1997], 225). 

For all control groups I exclude individuals who have served ın the 

military in the last five years, as ther mobility may be very different 
from that of civilians Also, I exclude disabled mdividuals from the 
control groups, as they may be more eligible for, or more interested 
ın, welfare than others in the group. 
5 This is the standard measure of welfare generosity in the literature 
Other aspects of welfare generosity such as eligibility standards are 
correlated, but distinct. See Bailey and Rom 2004 for further discus- 
sion of the multiple dimensions of welfare generosity Estimating the 
model using a measure of spending per poor person—a measure that 
taps eligibility as wellproduces simular results 
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averaged across 1985—90. I adjust for cost-of-living dif- 
ferences using Meyer’s (2000, 14) state price index 
(which focuses on variation in housing costs) and the 
national consumer price index. 

I control for moving costs with several variables. I 
measure the “fixed cost” of moving across state lines 
with a dummy variable called move that takes on a 
value of one if j # s. I measure the “variable cost” of 
moving, which depends on the distance of the move, 
with a variable called distance, which is the log of dis- 
tance between state s and state j. Interaction terms 
allow for the possibility that the effect of moving costs 
differs between the welfare and the nonwelfare popu- 
lations. 

I also control for the possibility that the welfare and 
nonwelfare populations respond differently to state 
characteristics. For example, individuals in the welfare 
population may care less about wages and unemploy- 
ment if they are expecting to rely on government or 
family assistance. Therefore I include interactions of 
state-level wage and unemployment variables with an 
individual-specific indicator variable for individuals in 
the welfare population. Although the general effects of 
wages and unemployment are not identified (because 
they are soaked up by the fixed effects), I can estimate 
the differential effect of these variables on the con- 
trol and treatment populations with these interaction 
terms. 


RESULTS 


The analysis proceeds in two steps. First, I replicate 
and extend the analysis by Schram, Nitz, and Krueger 
(1998) to make two points: (1) that models with no 
or few nonwelfare controls show no welfare effects 
and (2) that better accounting for state-level and 
nonwelfare determinants of migration produces initial 
evidence of a welfare effect. I then present results for 
the more flexible and powerful random utility model 


of migration. 


Revisiting Schram, Nitz, and Krueger 


Schram, Nitz, and Krueger (1998) model migration 
patterns of poor single mothers as a function of wel- 
fare, income, and employment differentials. (Allard 
and Danziger (2000, 361) provide, among other anal- 
yses, a similar analysis with no controls.) In two of 
four specifications, Schram, Nitz, and Krueger find a 
significant negative relationship between welfare ben- 
efits and migration. This odd result suggest either that 
high welfare benefits repel poor single mothers (which 
seems unlikely and would constitute a major paradigm 
shift if true) or that nonwelfare factors correlated with 
welfare benefits have been omitted and are causing a 
spurious negative relationship. 

To investigate whether omitted variable bias is the 
problem, Table 1 revisits Schram, Nitz, and Krueger’s 
model. The dependent variable is Census Bureau data 
on the proportion of poor, single women with chil- 
dren moving from one state to another between 1985 
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TABLE 1. Determinants of Interstate Mobil 


All State Dyads 


Welfare benefits difference 
Unemployment difference 
Income difference 
Nonwelfare migration 


Intercept 


Observations 
Re 


Vol. 99, No. 1 


Rates of Poor, Single Mothers 


Interstate Move Dyads Only 


2 


0.0001" 
(2.86) 
0.0001" 
(3.81) 
0.0001" 
(3.69) 
0.83" 

(57.58) 
0.0006"" 
(8.15) 


2,256 
0.596 


Mote Figures are OLS coefficients for a model in which the dependent variable Is the proportion of poor single mothers moving 
from state ; to state k for all continental state pairs; see text for details #statstcs are in parentheses *p < 0.05; "p < 001, 


mo < 0001. 


and 1990 for all state pairs. The independent variables 
measure welfare, unemployment, and income differ- 
ences. Column 1 reports results for a sparse specifica- 
tion as in Schram, Nitz, and Krueger. The results echo 
theirs: no variable is significant and the R* is hardly 
measurable. Column 2 reports results when J add con- 
trol for non—welfare state attractiveness with a vari- 
able measuring the proportion of poor, non-college- 
educated single women without children who moved 
from state j to state A The same economic, social, and 
cultural attributes of states affect these women and the 
welfare population with one important exception: the 
women without children were not eligible for AFDC. 
Their migration patterns therefore embody (and con- 
tro] for) the nonwelfare attractiveness of states. 

Including better controls dramatically changes the 
results. Most importantly for our purposes, the welfare 
variable is now positive and significant, as predicted by 
the welfare migration Pains One discordant note 
is the extremely high R*+. This occurs because the both 
the dependent variable and the nonwelfare migration 
variable are close to one for the 48 own-state pairs and 
close to zero for the 2,256 other pairs. Therefore the 
next two columns look only at interstate move dyads 
by excluding own-state pairs. Again, the sparsely spec- 
ified model performs abysmally and the model with 
improved controls performs much better. Here again, 
welfare benefits are positively and significantly associ- 
ated with migration. 

Other individual-level variables measure systematic 
determinants of individual-specific attraction to or re- 
pulsion from certain states. An excellent proxy for 
family ties and social capital is the birth state of an 
individual. The variable birth state is one if person i was 
born in state s and zero otherwise. Simply put, this vari- 
able controls for the possibility that—all else equal—a 
person born in Mississippi derives greater utility from 
living in Mississippi than someone born in Vermont. I 
also interact this variable with an indicator for individ- 
uals in the welfare population in case birth state effects 
differ for the welfare and nonwelfare populations. 





Conditional Logit Results 


Tables 2 through 4 present the results for the more 
compelling tests based on the individual-level model 
of state choice. I estimate but do not report state fixed 
effects for all specifications. 

Table 2 indicates that welfare benefits exert a pos- 
itive and highly significant effect on migration. The 
first specification includes only the distance, move, and 
welfare variables. The second specification adds birth 
state variables. I proceed in this fashion in order to 
highlight how omitting birth state effects attenuates 
the estimated effect of welfare on migration. Note that 
including the birth state variables causes the coeffi- 
cient on welfare benefits to almost double. The welfare 
benefits variable is significant—and hardly changed— 
in the third specification, which adds wage and unem- 
ployment interactions for the welfare population. In 
the last two specifications, I assess whether welfare 
benefit levels interact with birth state and distance. 
The results indicate that both interactions matter, but 
that neither substantially changes the results. Col- 
umn 4 reports the results when birth state and benefits 
variables are allowed to interact. The coefficient on 
welfare is higher than in the other specifications for 
non-birth states, while the coefficient on welfare for 
birth states (the sum of the main effect and the interac- 
tion) is essentially zero. This result implies that welfare 
and family effects are substitutes, not complements. 
Column 5 reports the results when distance and bene- 
fits interact. This tests whether the magnetic effect of 
welfare diminishes across space. The results indicate 
that this is indeed the case, as the interaction is signi- 
ficant. 

The other noteworthy result in Table 2 is the over- 
whelming statistical significance of the birth state vari- 
ables, which consistently has a t-statistic over 90. The 
interaction with the indicator variable for poor single 
mothers is also significant, indicating a stronger birth- 
state attraction for poor single mothers relative to the 
control group of women without children. Even taking 
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TABLE 2. Conditional Logit Model of Migration Cholce with Poor Single Women without Children 
as Control Group 


Welfare benefits » poor single mother 
Log distance 

Log distance * poor single mother 

Move 

Move » poor single mother 

Birth state 

Birth state * poor single mother 

Retall wage * poor single mother 
Unemployment » poor single mother 
Welfare » birth state « poor single mother 


Welfare benefits « poor single 
mother x distance 
Observations 
Treatment group 
Control group 
Pseudo-A* 


110,243 
69,270 
0.844 


(2 (3 (4) 

0.13 0.13" 
(10.17) 
0.57" 
(37.34) 
~0.01 

(0.63) 
~1,28—" 
(12.58) 

0.14 

(1.06) 

2,20% 
(94.74) 

0.28" 

(9.72) 


110,243 
69,270 
0.863 


110,243 
69,270 
0.863 


110,243 
69,270 
0.863 


110,243 
69,270 
0.863 


Note. Figures are coefficients from a conditional logit estimation. Vanablee that are the same for all Individuals within each state (e g, 
unemployment or average temperature) are controlled for with fxed effects for states (not reported); see text for detalls #statstics are 


In parentheses "p < 001, “™*p < 0.001. 


into account the massive sample size, there can be no 
doubt that birth state attractiveness matters. 

The control variables perform as expected. The dis- 
tance variable is negative and significant, implying that 
the farther away a state is, the less likely an individual is 
to move there; the effect is the same for both the treat- 
ment and the control groups in columns 2 through 4. 
The move variables indicate a clear fixed cost to moving 
across state borders that is the same for control and 
treatment populations. Wages are less magnetic and 
unemployment is less repelling to individuals in the 
welfare population. Given the availability of govern- 
ment support for these individuals, it is not surprising 
that market factors play less of a role in migration for 
them. 

Table 3 explores the robustness of the results by 
presenting results for alternative specifications of the 
model. In columns 2 and 3, the control group is sin- 
gle men without children; in columns 4 and 5, it is 
middle-income married women with children. ‘These 
specifications lead to rejection of the null hypothesis 
that welfare benefits exert no effect as long as birth 
state is taken into account. That is, in the specifications 
that include birth state and other controls (the third and 
fifth columns), welfare benefits are positive and statisti- 
cally significant. In the specifications that do not include 
birth state, the coefficient on welfare benefits is 30% 
smaller (when single men are the control group) or 
negative and borderline statistically significant (when 
married women are the control group). This again in- 
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dicates that failure to account for family ties in birth 
states can attenuate or even reverse estimated welfare 
effects. 

Again, the null hypothesis of no birth state effects is 
overwhelmingly rejected. Birth states exert a stronger 
attraction on poor single mothers, especially when the 
control group is middle-income married mothers. It 
appears that the family and employment concerns of 
married women’s husbands reduce the relative likeli- 
hood that these women will move to their birth states. 

The results also indicate that wages exert a smaller 
effect on the welfare population than on the single male 
control group. There does not appear to be a difference 
in the effect of wages relative to married women, sug- 
gesting that married women with higher incomes and, 
in many cases, working husbands do not experience 
much of a wage effect. Unemployment repels poor 
single mothers less than either control group, again 
consistent with the idea that the welfare population is 
more likely to have nonmarket means of support. 

The final column in Table 3 presents results for a 
conditional logit model estimated on a sample con- 
fined to those who did not reside in the same state 
in 1985 and 1990. Limiting the sample in this fashion 
addresses two concerns. First, it is possible that people 
make different calculations when deciding whether to 
move than when deciding where to move once they 
have decided to make an interstate move. Second, it is 
possible that cultural or behavioral differences across 
states may lead some states to have both high welfare 
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Log distance » poor single mother 
Move 
Move x poor single mother 
Birth state 
Birth state » poor single mother 
Retail wage * poor single mother 
Unemployment » poor single mother 


Observations 





Treatment group 110,243 
Control group 96,684 
Pseudo-A* 0.844 


In parentheses. *p < 0.05; "p < 0.001. 


benefits and, say, high rates of out-of-wedlock births. 
This could mean that states with high benefits may have 
a higher proportion of single mothers relative to the 
control group than other states even if welfare did not 
promote or deter migration. J exclude the move vari- 
ables from the analysis because they are not identified 
for a sample in which everyone moved. Despite the 
massive reduction in the sample size, the last column in 
Table 3 indicates that welfare and birth state variables 
remain highly significant. 

Table 4 includes race in the analysis by providing 
separate results for whites and African Americans. 
This controls for the possibility that state attractiveness 
varies by race and tests whether the marginal effects 
of welfare and other factors vary by race. The results 
indicate that race matters in some ways but not in oth- 
ers. Although state-level attractiveness varies across 
races, the marginal effects of the welfare, birth state, 
and other variables are similar for both races. The wel- 
fare variable is positive and significant even without the 
birth state variable for both races. What is happening 
here is that exclusion of both birth state and racial fac- 
tors severely attenuates the estimated effect of welfare 
on migration. Including one or both of these factors 
makes the effect more clearly visible. When both are 
accounted for (as in the second column for each race), 
the effect of welfare is most clearly apparent. 

The results indicate the existence of both birth state 
and welfare effects. But how large are these effects? To 
illustrate the magnitude of the birth state effect, I use 
specification (3) from Table 2 to simulate the probabil- 


TABLE 3. Alternative Specifications of the Conditional Logit Model of Migration Cholce 


Control Group 
Movers 
Single Men Married Women Only 
Welfare benefits + poor single mother 0.07™ 0.10 —0.02* 0.10" 0.19°" 
(5.70) (9.11) (1.80) (9.00) (10.32) 
Log distance —0.60°" —0.58°" —1.60™ 


110,243 110,243 110,243 9,841 
96,684 122,681 122,681 5,870 
0 863 0.854 0 869 0.928 


Note: Figures are coefficients from a condrtonal logit estrmation. Varlables that are the same for all indrviduals within each state (8.g., 
unemployment and average temperature) are controlled for with ftxed effects for states (not reported); see text for details. tstatistics are 

















ity that an individual moves from state j to state kwhen 
state kis or is not the individual’s birth state. To con- 
serve space, I discuss a representative example rather 
than provide a comprehensive tally of the simulations. 

Consider a poor single mother living in Ilinois in 
1985. If she was born outside the continental United 
States, the model predicts that she would have a 93.7% 
probability of living in Ilinois in 1990, a 0.11% proba- 
bility of living in Alabama in 1990, and a 0.37% prob- 
ability of living in Florida in 1990. If she had been 
born in Illinois, however, her probability of staying in 
Ilinois through 1990 would rise to 99.4%; if she had 
been born in Alabama, her predicted probability of 
moving from [Illinois to Alabama by 1990 would be 
1.3%. If she had been born in Florida, her predicted 
probability of moving from Illinois to Florida by 1990 
would be 4.3%. In general, women would be about 10 
times more likely to move to a given state if they had 
been born there than if they had been born outside of 
the continental United States. 

Birth state effects on migration not only are large 
in an absolute sense, but also are large relative to 
the effect of welfare on migration. In order to pro- 
duce a change in the probability of living in Alabama 
similar to that produced by changing a 1985 Ilinois 
resident’s birth state from the noncontinental United 
States to Alabama, Alabama would have to increase 
its average welfare spending by the implausibly large 
amount of $18,000 in 1983 adjusted dollars (its actual 
average spending from 1985 to 1990 in 1983 housing 
cost—adjusted dollars was about $4,500). 
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TABLE 4. Conditional Logit Model of Migration Cholce by Race 
Whites 


Welfare benefits » poor single mother 
Log distance 

Log distance » poor single mother 
Move 

Move » poor single mother 


Birth state 


Birth state * poor single mother 


Retall wage * poor single mother 
Unemployment * poor single mother 


Observations 
Treatment group 
Control group 

Pseudo-A* 


58,799 
49,696 
0.820 


African Americans 


Note: Figures are coefficients from a conditonal logit esimation Vanabies that are the same for all Indhi 
unemployment and average temperature) are controlled for with fixed effects for states (not reported), see text for details. Fetatsbhcs 


are ın parentheses. *p < 001, “p <0 001. 


That birth state effects are so large does not mean 
that the welfare effects are negligible. Table 5 uses the 
results from Table 4 to simulate the effects on mi- 
gration and spending of changes in welfare benefits 
For each state, I calculate the welfare population by 
computing the predicted probability individuals from 
all states move to (or remain in) that state. For each 
state, I then increase its welfare benefits (holding all 
other states constant) and calculate the net change 
in expected population. I look at two levels of in- 
creases. A 10% increase represents the average leg- 
islated increase in welfare benefits during the time pe- 
riod covered. The national standard deviation ($1,086 
per year in real, state-adjusted 1982—84 dollars) rep- 
resents a more meaningful change. Under AFDC such 
large changes were rare, but Temporary Assistance for 
Needy Families (“TANF”) TANF has produced very 
substantial changes across states (although the changes 
are harder to quantify in terms of benefit levels as the 
changes include changes in eligibility, sanctions, and 
other nonbenefit aspects of aid). 

Columns 2 and 3 in Table 5 simulate the effect 
on migration of an ordinary increase in benefits. Be- 
cause the simulated increase is in percentage terms, 
the simulated effects are lower for low spending states. 
The effects are modest but nontrivial. Alabama (which 
had a low level of benefits, meaning that a 10% in- 
crease would be absolutely small) would see a net 
inflow of 688 families headed by poor single moth- 
ers; Wisconsin (which had more generous benefits, 
meaning that a 10% increase would be absolutely 
large) would see an increase of 1,320 poor single 
mothers. 
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The columns on the right simulate what would hap- 
pen if a state increased its benefits by one standard 
deviation of all states while all other states maintained 
their benefit levels. The results are, not surprisingly, 
larger. For example, if Alabama increased its benefits 
by this amount, it would have 1,878 more poor sin- 
gle mothers. If Wisconsin increased its benefits by this 
amount, it would have 2,039 more poor single mothers. 
Because TANF has supplanted AFDC, these data are 
not to be taken as predictive; rather, they character- 
ize the magnitude of the effect of more substantial 
changes in welfare policy in terms of a well-studied 
program. 

Table 5 also provides an estimate of the annual 
amount spent on AFDC for the net in-migrants, cal- 
culated as follows. First, I estimated the eligibility per- 
centage for each state as the actual (unsimulated) num- 
ber of households receiving welfare divided by the ac- 
tual number of low-income single mothers. Second, 
I multiplied the expenditures per welfare household 
(from the Social Security Bulletin Annual Statistical 
Supplement for 1990 [Social Security Administration, 
various years]) times the net change in poor single 
mothers times the average eligibility of such individu- 
als. This amount is the approximate cost per year real- 
ized after changing policy and having five years’ worth 
of migration (since the estimated migration effects are 
based on data spanning five years). 


6 To produce conservative cost simulations, I set the eligibility pro- 
portion at one for states that have more households receiving welfare 
than low-income single mothers 
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TABLE 5. Simulated Effect of Welfare on Migration and Spending 


Welfare Increased by 10% of Own 


Welfare Increased by Natonal 


State Spending Standard Deviation 
Net Change Annual Migraton- Net change Annual Migration- 
of Poor Induced Change In of Poor Induced Change In 
Single Percentage AFDC Spending Single Percentage AFDC Spending 
Mothers Change (in 2000 Dollars) Mothers change (In 2000 Dollars) 

Alabama 688 0.83% 1,420,609 1,660 2.00% $3,430,833 
Arizona 748 1.52% $4,130,727 1,464 2.98% $8,087,585 
Arkansas 724 1.59% $2,418,703 1,399 3.08% $4,673,836 
Calfornla 2,666 0.83% $35,031,702 3,782 1.18% $49,689,734 
Colorado 892 2.13% $5,893,835 1,469 3.51% $9,712,292 
Connecticut 772 321% $9,097,960 1,262 5 24% $14,868,845 
Delaware 244 3.56% $1,470,342 468 6.82% $2,817,895 
Florida 1,724 1.00% $9,346,809 3,630 2.11% $19,686,960 
Georgla 1,191 0.87% $6,507,216 2,373 1.74% $12,968,386 
Idaho 372 3.30% $1,656,929 612 5.42% $2,723,995 
Ilinols 1,838 1.20% $12,981,360 3,322 2.17% $23,465,472 
Indiana 1,243 1.77% $6,751,476 2,204 3.13% $11,973,241 
lowa 716 2.36% $5,476,152 1,036 3.42% $7,924,389 
Kansas 795 2.80% $5,446,604 1,205 4.25% $8,255,996 
Kentucky 857 1.42% $3,960,985 1,638 2.71% $7,574,044 
Loulsiana 878 0.69% $3,032,914 1,763 1.39% $6,091,852 
Malne 424 2.74% $3,693,688 651 4.21% $5,687,848 
Maryland 931 1.85% $7,107,384 1,684 3.35% $12,852,611 
Massachusetts 1,290 2.03% $14,810,181 1,929 3.04% $22,146,686 
Michigan 2,058 1.36% $19,724,044 2,913 1.93% $27,916,721 
Minnesota 859 2.07% $9,072,960 1,177 2.85% $12,440,938 
Mississippl 690 0.76% $1,710,962 1,649 1.81% $4,089,303 
Missour 1,140 1.56% $6,452,632 2,068 2.83% $11,691,701 
Montana 366 3.00% $2,603,514 527 4.32% $3,741,044 
Nebraska 485 2.93% $3,357,771 751 4.54% $5,202,919 
Nevada 418 3.20% $2,397,503 847 6.47% $4,851,808 
New Hampshire 309 4.55% $2,749,967 529 7.78% $4,705,108 
New Jersey 1,307 2.03% $9,493,061 2,383 3.69% $17,305,984 
New Mexico 570 1.79% $3,181,471 1,038 3.25% $5,788,680 
New York 3,335 1.36% $38,102,980 4,768 1.95% $54,475,436 
North Carolina 1,243 1.09% $6,090,581 2,394 2.09% $11,730,348 
North Dakota 214 3.17% $1,585,323 291 4.32% $2,162,826 
Ohio 1,880 1.16% $12,710,548 3,115 1 92% $21,058,295 
Oklahoma 1,008 1.97% $5,803,655 1,609 3.15% $9,262,292 
Oregon 818 2.24% $6,318,966 1,271 3.48% $9,816,173 
Pennsytvania 1,596 1.20% $12,584,212 2,529 1.90% $19,941,384 
Rhode Island 458 3.99% $1,371,194 669 5.83% $2,005,138 
South Carollna 745 1.04% $2,696,859 1,490 2.08% $5,391,737 
South Dakota 240 2.76% $229,244 353 4.07% 7,968 
Tennessee 939 1.08% $5,266,028 2,132 2.46% $11,957,675 
Texas 1,690 0.60% $6,891,791 3,754 1.33% $15,312,704 
Utah 491 2.76% $253,406 758 4.26% $391,456 
Vermont 314 5.09% $1,752,887 408 6.62% $2,277,823 
Virginia 995 1.32% $5,435,119 2,028 2 69% $11,068,208 
Washington 1,148 1.94% $10,698,691 1,650 2.79% $15,382,629 
West Virginla 646 2.08% $3,326,057 1,048 3.38% $5,393,940 
Wisconsin 1,320 2.58% $12,646,313 1,708 3.33% $16,357,411 

min 265 4.24% $1,713,030 408 6.51% $2,627,578 


Note Simulation based on specrhcation 3 In Table 2 and assumes that other states do not change policies. See text for detalls 


These financial simulations understate actual costs 
in several respects. First, costs would continue indefi- 
nitely, accruing every year. Second, the costs would rise 
as in-migration continued over time. Third, the amount 
does not include other expenses associated with the 


in-migration of poor single mothers. These expenses 
may be large. For example, in 1990, almost four times 
as much was spent on Medicaid as on AFDC. Other 
social services and education also involve substantial 
sums. Fourth, benefit increases would also have direct 
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costs due to increased spending on in-state residents, 
some of whom might be more likely to seek welfare 
benefits if benefits were increased. 

Even for these narrowly defined costs, Table 5 in- 
dicates a nontrivial financial effect of welfare migra- 
tion. For example, five years after increasing benefits 
by one standard deviation, Alabama would be pre- 
dicted to spend $3.8 million more per year on the net 
in-migration alone. Even increasing benefits by 10% 
would cost Alabama $1.1 million a year on the in- 
migrants. Cutting costs by these amounts would yield 
commensurate savings. Although these estimates are 
modest relative to overall state budgets, they demon- 
strate how welfare-induced migration can make raising 
benefits more problematic and cutting benefits more 
tempting for many states. 


CONCLUSION 


The analysis presented here is built on three premises. 
First, we should be wary of existing research that is 
based on inadequately specified models. Second, state- 
level fixed effects, combined with a quasi-experimental 
research design, can effectively control for state at- 
tributes that affect migration. Third, a discrete-choice 
conditional logit model can effectively control for im- 
portant individual determinants of migration such as 
family and race. 

The results validate prior indications that family 
ties fundamentally influence residential choice. In all 
specifications, birth state effects are huge. These ef- 
fects dwarf welfare effects, supporting Schram, Nitz, 
and Krueger’s (1998, 228) rejection of narratives 
in which poor single mothers narrow-mindedly and 
illegitimately—if economically rationally—fiock to 
states with higher benefits. These results imply that 
poor single mothers are as likely—or even more 
likely—to move home as everyone else. But more im- 
portantly for the vast literature on welfare and mi- 
gration, the results also indicate that welfare benefits 
exert a nontrivial effect on state residential choice. ‘This 
finding stands in contrast to much previous work. 

The results do not imply that welfare-induced mi- 
gration will fundamentally remake the demographic 
profile of the country. However, the welfare migration 
hypothesis does not require welfare to exert a dom- 
inant effect, only a real effect. And here, the results 
provide strong, robust indications that the effect is real. 
For example, simulations indicate that if California in- 
creased its benefits by a standard deviation, it would 
have a net inflow of about 4,289 households headed by 
poor single mothers after five years. These additional 
households would add approximately $56 million per 
year indefinitely to AFDC costs, to say nothing of ad- 
ditional costs associated with Medicaid, housing, and 
other services. Such effects are modest relative to state 
populations and budgets; whether they are modest with 
regard to the politics of state policymaking is an open 
question. 

The results have important policy implications. They 
do not provide guidance whether redistributive social 
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spending is desirable or effective; instead, the results 
imply that migration may discourage states from pro- 
viding high welfare benefits because such generosity 
attracts and retains potential welfare recipients. There- 
fore, if redistributive social spending is desirable, then 
policymakers need to create institutional structures 
that can support it. They could do this, for example, 
with federal matching programs designed to offset for 
states the costs associated with welfare-induced mi- 
gration (see Gramlich 1985 and Inman and Rubinfeld 
1997, 58). The recent move toward funding welfare 
with block grants, on the other hand, may increase 
incentives for politicians to cut funding and may fur- 
ther undermine the ability of states to formulate social 
policies independently. 

The results also have implications for broader 
debates about governmental capacities for redistribu- 
tive social spending. Migration effects can be real but, 
nonetheless, obscured by the complexity of model- 
ing reality. This implies that scholars should continue 
to cast a critical eye on empirical results in related 
areas such as tax and regulation related migration. 
Substantively, the findings validate the idea that mo- 
bility constrains governmental activities in at least 
one important context. To say that such constraints 
exist, however, is not to say that they are neces- 
sarily problematic. After all, these constraints could 
constitute either an undue hindrance on democratic 
autonomy or a useful discipline against excessive 
redistribution. Therefore, scholars should also con- 
tinue the arduous task of normatively and theoreti- 
cally investigating the proper role of government and 
the institutional mechanisms for enabling government 
to play that role in an increasingly integrated—and 
mobile—world. 
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Rousseau’s Anti-Agenda-Setiting Agenda and Contemporary 


Democratic Theory 


JOHN T. SCOTT University of California, Davis 


n his recent article, “Rousseau on Agenda-Setting and Majority Rule” (2003), Ethan Putterman 

examines how the democratic principle of popular majority rule might be reconciled with agenda- 

setting by legislative experts through an analysis of Rousseau’s political theory. He argues that 
Rousseau accomplishes this reconciliation through a novel separation of powers between the legislative 
and the executive powers where the sovereign people delegates the exclusive power to initiate laws to 
the executive. Putterman thereby identifies as a solution to the problem of democratic self-legislation 
what Rousseau sees as the most important danger to it. At issue is not merely the correct interpretation of 
Rousseau’s theory, for Putterman’s argument raises far-reaching questions concerning the compatibility of 
democratic principles and institutions. After demonstrating that Putterman is incorrect that the sovereign 
people in Rousseau’s state delegate the power of legislative initiative, I examine how Rousseau anticipates 
and addresses a related question central to contemporary democratic and social choice theory: the 
problem of preference aggregation through voting in the absence of agenda-setting institutions. 


ousseau’s political thought remains a central 
R ee point for democratic theory. Partic- 

ipatory democrats such as Barber (1984) and 
Pateman (1970) appeal to Rousseau’s ideas about 
democratic legitimacy and the supposed transforma- 
tive effects of participation, arguing that representa- 
tive and liberal political institutions that determine 
the legislative agenda undermine democratic self- 
government. Critics of strong democracy, in turn, target 
Rousseau as the primary exponent of a theoreti- 
cally and practically untenable populism. Within so- 
cial choice theory, for example, Riker (1982) famously 
argues that Rousseau’s theory of the “general will” 
falls prey to paradoxes of voting that make trying to 
identify the common good through voting meaningless, 
and suggests that agenda-setting is a pervasive feature 
of political and social organizations. This debate over 
democratic legitimacy and institutions touches on a 
broad spectrum of issues in contemporary democratic 
theory and political science more generally. 

In his recent article “Rousseau on Agenda-Setting 
and Majority Rule,” Ethan Putterman (2003) exam- 
ines through an analysis of Rousseau’s political theory 
how the democratic principle of popular majority rule 
might be reconciled with agenda-setting by legislative 
experts. Putterman argues that Rousseau accomplishes 
this reconciliation through a novel separation of pow- 
ers between the legislative and the executive powers 
where the sovereign people delegates the exclusive 
power to initiate laws to the executive, or “govern- 
ment.” 

Putterman’s attention to the institutional features 
and actual operation of the state outlined in The Social 
Contract is a welcome departure from most interpre- 
tations. Too often scholars view Rousseau’s treatise on 
the “principles of political right” as mere principles, yet 
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fully half of the work is devoted to the art of govern- 
ment. Rousseau’s theory of the separation of powers is 
“perhaps the most original aspect of his political teach- 
ing” (Masters 1968, 335), but it is widely overlooked or 
misunderstood. Participatory democrats, for example, 
tend to ignore the fact that he divides executive and 
legislative power. Indeed, they usually fail to mention 
that if Rousseau famously insists on democratic self- 
legislation, he argues just as emphatically against demo- 
cratic self-government. If the participatory interpreta- 
tion overlooks the separation of powers, so too does the 
opposed reading that sees Rousseau’s sovereign people 
as inactive or “immobile.” This reading is typified by 
Shklar (1969), who argues that “the sovereign does very 
little” (181). For Shklar, the institutions of Rousseau’s 
state are designed “to prevent all change,” and she 
therefore almost entirely ignores the separation of 
powers and only briefly discusses the government, char- 
acterizing it as the “active agent of destruction” of the 
state (209-11). Finally, interpreters who do take note of 
Rousseau’s separation of powers often argue that this 
delegation of power seriously undermines democratic 
sovereignty (Cullen 1993; Fralin 1978; Johnston 1999), 
Putterman steps into this debate to analyze the consis- 
tency between Rousseau’s political principles and the 
Institutions he recommends, bringing Rousseau into 
contemporary debates over democratic principles and 
political institutions in a potentially fruitful way. 
Despite the virtues of his approach, however, 
Putterman’s interpretation is based on a misunder- 
standing of Rousseau’s theory of the separation of 
powers. “As brief as Rousseau is on this point,” 
Putterman (2003) claims, “he is consistent that a body 
other than the sovereign must initiate the law” (467). 
Rather than letting the sovereign both initiate and 
ratify the law, he claims, Rousseau chooses “to sub- 
divide the legislative power between representatives 
and the majority” (466). In arguing that Rousseau del- 
egates the exclusive power to initiate law to the govern- 
ment, however, Putterman identifies as a solution to the 
problem of democratic self-legislation what Rousseau 
sees as the greatest danger to it: the transfer of 
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agenda-setting power to the government. Rousseau 
does not argue that the legislative power must be di- 
vided or subdivided, and he stresses throughout his 
work that the government must be restricted to exe- 
cuting the law. 

At issue here is not merely the correct interpreta- 
tion of Rousseau’s political theory, for Putterman’s ar- 
gument raises far-reaching questions concerning the 
compatibility of democratic principles and institu- 
tions. In this article, I analyze Rousseau’s theory 
of the separation of powers and demonstrate that 
Putterman is incorrect in his claim that Rousseau in- 
vests the executive with the agenda-setting power of 
legislative initiative. Throughout I follow Putterman in 
using the term “agenda-setting” to refer to the power of 
legislative initiative, although this power is more prop- 
erly a form of “gatekeeping,” which is merely one form 
of “agenda-setting.” If Rousseau rejects such agenda- 
setting power by the executive as undemocratic, his 
theory may fall prey to a related issue central to con- 
temporary democratic and social choice theory: the 
problem of preference aggregation through voting in 
the absence of agenda-setting institutions. I therefore 
conclude by examining how Rousseau anticipates and 
addresses the paradox of voting. Overall, my aim is 
to show that Rousseau’s treatment of both agenda- 
setting and voting throws light on issues of principles 
and institutions still at the center of democratic theory 
and practice. 


ROUSSEAU’S ANTE-AGENDA-SETTING 
AGENDA 


—I warn the reader that this chapter 
should be read carefully, and that I do 
not know the art of beng clear for those 
who are not willing to be attentive. 


Jean-Jacques Rousseau (The Social 
Contract [SC] 1.1) 
Rousseau includes the above admonition to the poten- 
tially inattentive reader at the head of his discussion 
of government in Book IMI of The Social Contract. The 
warning is unique to the treatise and is meant to expose 
a particularly pernicious error in politics. Up to this 
point in the work, Rousseau has insisted that law can 
legitimately come only from the sovereign people, but 
now he is about to argue that the power to execute 
the law should be delegated to a separate body, the 
“government.” This delegation of power nonetheless 
poses a threat. Rousseau’s warning is animated by his 
fear that the people will lose their sovereign authority, 
either when the executive illegitimately assumes the 
legislative power or when the citizens mistakenly alien- 
ate their right to representatives. One of Rousseau’s 
primary purposes in Book III of his treatise is, there- 
fore, to alert the people to the danger inherent from 
the government and to suggest ways in which that body 
can be kept within its sphere of executing the law rather 
than making it. 
Given Rousseau’s insistence on the separation be- 
tween legislative and executive powers, Putterman 
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(2003) asks the right question in response to the overly 
democratic reading of Rousseau that would invest both 
the legislative and the executive powers in the peo- 
ple: “Why is a separation of powers among lawmakers 
even necessary?” (460-61). Putterman is correct that 
Rousseau is dividing political power, but the answer he 
provides to the question is based on a misunderstand- 
ing of the separation of powers and Rousseau’s reasons 


for arguing that it is necessary. 


The Separation of Legislative 
and Executive Power 


In the chapter to which he attaches his warning to 
the inattentive reader, Rousseau distinguishes between 
the legislative and the executive powers. The relation- 
ship between the sovereign people and the govern- 
ment has proved “unfathomable in politics,” according 
to Rousseau: “Here, all legislators have gone astray” 
(Rousseau 1978, 168; see also Letters Written from the 
Mountain, Letter 8, Rousseau 1990-, 9:257). His delin- 
eation of the legislative and executive powers requires 
equally clear and distinct terminology. In explaining his 
novel theory of the separation of powers he nonethe- 
less chooses to use familiar terms, giving old words 
new meanings. His warning at the head of the chapter 
therefore alerts the reader to pay attention to usage. 
Not only do Rousseau’s terms often not correspond 
to our usage today, posing an obstacle to proper un- 
derstanding of his work, but also they are sometimes 
at odds with the usage of his own time. Such is the 
case for the general term he uses to name the execu- 
tive power: “government.” The term “government” in 
Rousseau’s time, as often in ours, includes the power to 
make law. Indeed, the sovereign legislative power is the 
main attribute of “government” in this understanding. 
This is the case even for Montesquieu, the theorist who 
most influenced Rousseau’s theory of the separation of 
powers (see Montesquieu [1748] 1951, II.1). Rousseau 
complains, however, that the government “has been 
incorrectly confounded with the sovereign, of which it 
is only the minister” (SC IB.1). His theory requires a 
radical separation of powers that restricts the govern- 
ment to the execution of the law. 

The primary purpose of the separation of powers in 
Rousseau’s theory is to maintain the generality of the 
laws from the particularity necessary in their execution 
by delegating the executive power to a distinct body. 
“We have seen that the legislative power belongs to the 
people and can belong only to it,” he explains at the 
beginning of his discussion of government (SC U1; 
see II.2). Since the laws emanating from the general 
will of the people must be general in their form as 
well as in their source (IL.6), the power to execute 
the laws cannot belong to the people in its capacity as 
sovereign (III.1). A separate body is needed to execute 
the laws and give movement to the body politic. This is 
the “government.” Rousseau gives several alternative 
names to the body and its members: “The members of 
this body are called magistrates or kings, that is, to say 
governors; and the body as a whole bears the name of 
prince” (II.1). His terms amount to putting entirely 
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new wine into old bottles, and he thereby intimates that 
kings and princes have hitherto misunderstood—or 
have abused—their proper function. 

Rousseau consistently emphasizes the limited func- 
tions and subordinate nature of the government. The 
government is “an intermediate body established be- 
tween the subjects and the sovereign for their mutual 
communication and charged with the execution of the 
laws and the maintenance of civil as well as political 
freedom” (SC II.1). The government is “only the min- 
ister” of the sovereign. “It is absolutely nothing but a 
commission, a function in which, as simple officers of 
the sovereign, they exercise in its name the power that 
bas been entrusted to them by the sovereign, and that 
the sovereign can limit, modify, and take back when- 
ever it pleases” (IIL1). As he explains earlier in the 
work when discussing the fact that the law much come 
from the sovereign people, “the government must not 
be confounded [se confonde] with the sovereign, but 
must be its minister” (II.6, n.). The government and the 
sovereign must not “become merged” (se confondre) 
when it comes to the making of law: “The people that 
is subject to the laws ought to be their author” (11.6). In 
bis more detailed discussion of the role of government 
in the essay on “Political Economy,” where he also 
stresses the need to distinguish the sovereign and the 
government, Rousseau (1978, 211) characterizes the 
government as attending to the execution of the laws 
and to “administrative and economic details” under 
the guidance of the general will expressed through law 
(216). Nowhere in his treatments of government does 
he suggest that it shares the legislative power with the 
people. 

Rousseau’s discussion of the different forms of gov- 
ernment revolves around the question of which form 
will best maintain the separation of legislative and ex- 
ecutive powers. For example, in his rejection of democ- 
racy as a form of government he emphasizes the main 
reason for separating the legislative and executive pow- 
ers: investing the same body with both powers would 
lead inevitably to the corruption of the general will by 
the temptation of particular considerations. “If there 
were a people of Gods, it would govern itself demo- 
cratically. Such a perfect government is not suited to 
men” (SC I.4). His discussion of monarchy likewise 
points to the frailty of human nature as a danger, in this 
case the danger that the king will exceed the bounds 
of his executive function. “Kings want to be abso- 
lute,” Rousseau declares, and he subtly underscores 
the threat from those who have considered themselves 
“sovereign” by avoiding the term in the chapter on 
monarchy (1.6; Gildin 1983, 114). Finally, in recom- 
mending elective aristocracy as his preferred form of 
government, Rousseau stresses not only that it is most 
liable to lead to government by those with probity, 
enlightenment, and experience, but also that it “has 
the advantage of distinguishing between the two pow- 
ers,” legislative and executive, that he wants to keep 
separate (SC I.5). 

If the first half of Book IN of The Social Contract 
is devoted to defining “government,” delineating its 
limited functions, and examining the forms it might 
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take, the second half of the book is concerned with 
preventing the usurpation of the sovereign power by 
the government (see Gildin 1983, 127). Rousseau views 
the expansion of government power, and its concentra- 
tion in an ever smaller body, as the main cause of the 
decline and death of the body politic. The dissolution 
of the state, he explains, comes about “when the prince 
no longer administers the State in accordance with the 
laws and usurps the sovereign power” (SC 0.10). He is 
similarly concerned that a people fallen away from vir- 
tuous concern with the public good will cede its power 
to legislate to representatives, an illegitimate alienation 
of authority. “Sovereignty cannot be represented for 
the same reason it cannot e alienated. It consists es- 
sentially in the general will, and the will cannot be rep- 
resented” (1.15). In order to ensure that the people 
preserves its sovereign power, Rousseau argues that it 
must assemble as often as possible (1112-14). When 
the people assembles, the power of the government 
ceases (III.14), and the assembly is opened by asking 
whether the sovereign wishes to maintain the same 
form of government and whether it wishes to leave the 
administration in the hands of those currently respon- 
sible for it (111.18). This argument about the cessation 
of the executive power alone provides persuasive ev- 
idence that the deliberations of the sovereign people 
are not limited by any agenda set beforehand by the 
government. Indeed, like the decision to maintain or 
abolish the present form of government, the institution 
of government is itself a law that cannot have been 
put on the agenda by the body it creates (01.17). As 
Rousseau (1990—) explains in a fragment, “If laws exist 
before government, they are therefore independent of 
it; the government itself depends on laws since it is from 
them alone that it derives its authority; and far from 
being their author or master, it is only their guarantor, 
administrator, and at most, interpreter” (4:28). 


A Critique of Putterman’s Interpretation 


Putterman claims throughout his article that Rousseau 
invests the agenda-setting power of legislative initiative 
with the government, but he acknowledges that the ev- 
idence is sparse. He is surprised that “Rousseau offers 
just two brief cryptic comments on the positive features 
of legislative representation in his mature state—and 
both within the context of rejecting the sovereignty 
of representatives.” He conjectures that Rousseau did 
not elaborate on this purported agenda-setting power 
because he wanted to obscure the significance of the 
government’s power (Putterman 2003, 461-62; see also 
460, n.2). Examination of the evidence Putterman cites 
for his claim reveals not only that the government in 
Rousseau’s political system does not legitimately have 
the power to initiate laws and set the legislative agenda, 
but that the desire of the government to possess that 
power poses a constant threat to the people’s legislative 
authority. 

The first passage Putterman adduces for his claim 
that the government possesses agenda-setting power 
comes from the chapter of The Social Contract “On 
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Deputies or Representatives” (IJ.15). Rousseau ex- 
plains there: 


Sovereignty cannot be represented for the same reason 
it cannot be alienated. It consists entirely in the general 
will, and the will cannot be represented. Either it is itself 
or it is something else; there 1s no middle ground. The 
deputies of the people, therefore, are not nor can they be its 
representatives; they are merely its agents [commissaires]. 
They cannot conclude anything definitively. 


This passage is an inauspicious source for wisdom on 
the power and role of the government since Rousseau 
is not even discussing the government’s (proper) func- 
tion in this context. Rousseau’s argument in the chapter 
under analysis is that the existence of representatives 
or deputies is a sign of a degenerated state. His spe- 
cific point in the passage is that if the citizens decide 
to “name deputies and stay home,” their supposed 
representatives should be considered as mere agents 
(commissaires) who make proposals for the people to 
consider in its capacity as the legislative authority. He is 
pot describing the legitimate role of the government in 
his own system, for he is decisively not recommending 
representation in any form. His intention is rather to 
warn against an improper understanding of delegated 
legislative power, and his argument is at best a sugges- 
tion of how to make the best of a bad situation. 

Even assuming that Rousseau was describing the 
role of the government in this passage, the infer- 
ences Putterman draws from it go well beyond what 
is actually written. Putterman’s positive claim that the 
government possesses agenda-setting power is based 
on Rousseau’s negative statement that the “deputies” 
or “agents” of the people “cannot conclude anything 
definitively.” Putterman seems to assume that if these 
commissaires “cannot conclude anything definitively,” 
they must be proposing something. That is, they must 
propose laws. Furthermore, rather than arguing that 
the government can propose laws for the people’s con- 
sideration, which is a strained but plausible interpre- 
tation of this passage under the present assumption, 
Putterman claims that only the government may initi- 
ate laws. To be sure, Putterman (2003) argues that the 
government may not actually make the laws: “Laws 
proposed by commissaires are illegitimate and the ex- 
pression of an alien will unless and until each is met with 
the express approval of the majority” (464). While it 
is true that Rousseau insists that the sovereign peo- 
ple alone can make law, he does not positively argue 
that commussaires, magistrates, or anyone else has the 
power to propose laws. 

Finally, since I have emphasized Rousseau’s insis- 
tence on precise terminology, let me note that the 
chapter “On Deputies or Representatives” is the only 
place in The Social Contract where Rousseau uses 
the terms “representative,” “deputy,” and “agent” 
(commissaire). By sequestering these terms, he in- 
dicates that they are not synonymous with the 
proper term for members of the government, “mag- 
istrates.” Putterman (2003) nonetheless uses the term 
“commissaires” throughout his article interchangeably 
with “magistrates,” and does the same with “deputies” 
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and also occasionally with “representatives” (e.g., 
page 459 [abstract]), although he does distinguish be- 
tween commissaires and representants at one point 
(461; see also 467). At a minimum, Putterman thereby 
introduces unnecessary confusion into his interpreta- 
tion by blurring the terms Rousseau wants to keep 
separate, like the roles they identify. 

The second passage from The Social Contract 
Putterman cites on behalf of his interpretation also 
comes from an argument about the Illegitimate usurpa- 
tion of the people’s legislative power. The passage in 
question concludes the chapter “That the General Will 
Is Indestructible”: 


I could make many comments here on the simple mght 
to vote in every act of sovereignty, a right that nothing 
can take away from the citizens, and on the right to give 
an opinion, to make propositions [proposer], to analyze, 
to discuss, which the government is always very careful to 
allow only to its members. But this important subject would 
require a separate treatise, and I cannot say everything in 
this one. (IV.1; trans. altered) 


The intended irony of this passage is manifest from 
its context. Rousseau argues in this chapter that the 
general will endures in a state even as the social bond 
weakens and private interests begin to triumph over 
the public interest. “Does it follow from this that the 
general will is annihilated or corrupted? No, it is always 
constant, unalterable, and pure. But it is subordinate to 
others that prevail over it” (SC IV.1). In this context, 
then, he argues that the people maintains its rights— 
even when the government attempts to prevent it from 
doing so. Hence the sarcastic conclusion: “The govern- 
ment is always very careful to allow only to its mem- 
bers” the power to give opinions, make propositions, 
analyze, and discuss. If the statement were limited to 
stating that the government is careful to retain the right 
“to make propositions” (proposer), then it might pos- 
sibly be read as sincere. But surely the other rights he 
lists—giving an opinion, analyzing, and deliberating— 
belong to the citizens, and not to the government 
alone. They have the right to propose laws and then 
analyze, discuss, and vote on them. This interpreta- 
tion is strengthened by the very next chapter, where 
Rousseau discusses voting and insists that the citizens 
must have “public deliberations” (IV.2). Far from indi- 
cating that the government has agenda-setting power, 
then, Rousseau’s text is a warning that the government 
will try to usurp the people’s exclusive right to make 
law. 

To be fair, the irony of this passage has eluded 
interpreters besides Putterman, including Derathé, 
who voices perplexity over Rousseau’s apparent 
limitation of the legislative right in bis editorial note 
to the passage in the Pléiade edition (Rousseau 1959- 
95, 3:1492-93, n.1-439; see also Cullen 1993, 215-16). 
Gildin is closer to a correct interpretation, I think, when 
he glosses Rousseau as saying that the government 
“seeks” to arrogate these rights to itself, and suggests 
that the author’s words “do not amount to an enthusias- 
tic endorsement of the practice they describe.” He goes 
too far, however, when he concludes that “Rousseau’s 
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remarks suggest that there are many possible arrange- 
ments regarding the right to propose and to debate 
laws compatible with his principles” (Gildin 1983, 159; 
see also Melzer 1990, 173). Bertram (2004, 173) is still 
more tentative, concluding, “Certainly it looks like an 
unsatisfactory basis for ascribing to Rousseau the po- 
sition that the government should dominate, since so 
much of the discussion of Book III has centered on the 
dangers of this very possibility.” In line with my own 
interpretation, Kelly argues that Rousseau is “warn- 
ing against this practice as a symptom of the effort of 
governments to usurp legislative power.” He further 
points out that Rousseau never places restrictions on 
the right of the people to give opinions, make propo- 
sitions, analyze, and discuss legislation, and suggests 
that the “important subject” requiring a separate trea- 
tise refers to the conditions for preserving democratic 
deliberation (Kelly 2003, 125-26). 

Finally, elsewhere ın his writings Rousseau does dis- 
cuss the power of government to propose laws in his 
native Geneva, but Putterman wisely rejects this evi- 
dence. Like the first passage analyzed above from The 
Social Contract, concerning representatives, these dis- 
cussions have the remedial purpose of at least limiting 
the illegitimate powers usurped from the citizenry. In 
the relevant passage from the Dedication to Geneva 
from the Discourse on Inequality, Rousseau (1990-) 
writes: “I would have desired that, in order to stop 
the selfish and ill-conceived projects and the dangerous 
innovations that finally ruined the Athenians, everyone 
did not have the power to propose new laws accord- 
ing to his fancy; that this right belonged exclusively to 
the magistrates” (3:5). This passage is cited by Fralin 
(1978, 54) in his analysis of Rousseau and represen- 
tation as evidence that the government properly pos- 
sesses the power to propose legislation in Rousseau’s 
state (see also Melzer 1990, 173). Putterman (2003) 
nonetheless persuasively argues that this evidence of- 
fers “soft ground” for such an interpretation given 
that Rousseau’s intentionally idealized portrait of his 
native city is an implicit critique of the illegitimate 
usurpation of the legislative power by the Petit Conseil 
(461-63). I would add that Rousseau’s (1990-) argu- 
ment here is that no one should have “the power to 
propose new laws according to his fancy [fantaisie],” 
since for him legitimate laws must have a specific form 
and formality. He is not arguing that the citizens do 
not have the right to propose legislation, for in the 
preceding paragraph he asserts that the “the right of 
legislation [droit de législation|” should be “common 
to all citizens” (3:5), and, given the formulation, this 
right presumably includes the right to propose proper 
laws. 

The potential evidence from the Letters Written from 
the Mountain (esp. Letter 9, Rousseau 1990-, 9:283-86) 
is similarly unpersuasive. Putterman does not refer to 
these passages in his analysis, but Melzer (1990) does 
in his considerably more tentative discussion of the 
possible role of the government in initiating legisla- 
tion (173, 212; see also Gildin 1983, 159). Rousseau’s 
concern in this context, however, is to wrest as much 
power as possible from the Petit Conseil, which claims 
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the sole right to legislate, and if he recognizes its 
“negative right” over the laws, restricting the body to 
this function would amount to an effective reduction 
of its legislative power. He therefore strenuously in- 
sists that the right to debate proposals belongs to the 
people and he argues that the people also has the right 
to enact laws not proposed by the magistrates (Letters 
Written from the Mountain, Letter 7, Rousseau 1990-, 
9:250). The most promising passage for the alternative 
interpretation, cited by Melzer, comes when Rousseau 
agrees with his opponent that some form of veto power 
“very reasonable,” at least in the case put forward, and 
explains that it would be impossible for a democratic 
constitution to “maintain itself if the legislative power 
could always be set into motion by each of those who 
compose it” (Letter 9, 9:285). First, in the Genevan case 
Rousseau is discussing there are two legislative bodies 
that “compose” the legislative power, and he is there- 
fore discussing their concurrence and a possible “neg- 
ative power” of one body on another. Second, as with 
the passage from The Social Contract where he objects 
to anyone being able to propose new laws “according 
to his fancy,” Rousseau is once again emphasizing that 
the initiation of legislation cannot “always be set into 
motion,” but can legitimately occur only under certain 
circumstances and in certain forms (see SC 01.13). In 
any case, Rousseau’s acknowledgment of the power to 
propose laws possessed by the Genevan government 
does not constitute a division of legislative power he 
would otherwise approve. 

In sum, then, the evidence from The Social Contract 
and elsewhere in the author’s writings does not support 
Putterman’s claim that the government in Rousseau’s 
state possesses the exclusive right to propose legisla- 
tion. In my analysis of the passages Putterman cites 
for his interpretation, I have shown that Rousseau’s 
intention is quite the contrary: he is warning the peo- 
ple that the government will try to usurp its legislative 
authority by claiming the sole right to propose and 
debate legislation or, when he seems to acknowledge 
such a right, he is trying to limit the damage from what 
he otherwise considers an illegitimate arrangement. If 
Rousseau nowhere positively argues that the govern- 
ment should possess the right to propose laws, much 
less the exclusive nght to do so, the most plausible 
alternative argument would be that, since he nowhere 
positively asserts that the government can never pos- 
sess the power of legislative initiative, the government 
may under certain conditions propose laws for the peo- 
ple’s consideration. Such an argument is plausible, and 
is similar to the interpretations offered by Gildin (1983) 
and Melzer (1990). If J am correct about the available 
evidence, however, this interpretation suffers from the 
fact that it argues from a negative; it is a dog that did 
not bark in the dark. The dog may indeed sometimes 
bark, for Rousseau does not definitively state that the 
government can never propose laws. Given that he 
explicitly states that the people alone possesses the 
legislative power, however, the more compelling in- 
terpretation is that in a legitimate state the right of 
legislative initiation belongs to the people as part of its 
legislative power. 
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ROUSSEAU AND CONTEMPORARY 
DEMOCRATIC THEORY 


From Rousseau’s perspective, as well as from the per- 
spective of his contemporary democratic followers, 
agenda-setting is a threat to popular sovereignty. I have 
argued against Putterman that Rousseau does not del- 
egate the agenda-setting power of legislative initiative 
to the government. If I have removed Rousseau from 
the frying pan of this agenda-setting power, however, I 
may have done so only to throw him into the fire of the 
paradoxical problems of voting identified by contem- 
porary social choice theory. In order to broaden the 
inquiry into the relevance of Rousseau’s thought to 
contemporary democratic theory begun by Putterman 
through his discussion of agenda-setting and major- 
ity rule, then, in conclusion I suggest how Rousseau 
anticipates and addresses the related issue of voting 
in the absence of agenda-setting institutions such as 
legislative “gatekeeping.” 

The classic contemporary critique of Rousseau that 
raises the paradoxes of voting in the absence of agenda- 
setting institutions is Riker’s Liberalism Against Pop- 
ulism (1982). Riker discusses the problems of pref- 
erence aggregation, cycling, and agenda-setting faced 
by any voting system, including Rousseau’s “populist” 
theory. Drawing on Arrow (1963), he shows that where 
there are at least three voters and three choices, differ- 
ent outcomes may result from identical values depend- 
ing on the voting rule. Given this “paradox of voting,” 
there is no actual or identifiable common good, or 
“general will.” Similarly, different orderings of pref- 
erences would lead to “cycling,” where voters cycle 
through different, but unstable majorities. Under these 
conditions there is a pervasive opportunity for manip- 
ulating the outcome of voting through agenda-setting, 
or controlling the order in which the alternatives are 
considered. If Rousseau is aware of the pervasiveness 
of agenda-setting and tries to prevent the government 
from controlling the legislative agenda, is he similarly 
conscious of the paradoxes of voting? 

There have been a number of responses to Riker’s 
argument and his critique of Rousseau, including skep- 
tical investigations of whether deliberative democracy 
can remedy the problems of aggregative democratic 
institutions identified by social choice theory (Knight 
and Johnson 1994) and arguments about the efficacy 
of deliberative democracy based in part on a critique 
of the social choice approach (Dryzek 2000). The most 
interesting attempts to clarify Rousseau’s thought with 
the findings of social choice theory in mind have cen- 
tered on interpreting the “general will” in light of 
Condorcet’s jury theorem (Grofman and Feld 1988; see 
also Barry 1965, 292-94, and Runciman and Sen 1965). 
These attempts characterize voting in Rousseau’s the- 
ory as the expression of opinions about what is in the 
common interest, or the identification of the general 
will, and therefore adopt an “epistemic” conception 
of voting (see Coleman and Ferejohn 1986). This ap- 
proach to the general will is useful, and has good textual 
warrant in Rousseau’s writings. Before the citizens give 
an opinion on the common good, however, they have 
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to will or desire it. The measures Rousseau proposes to 
increase the probability that the general will is realized 
in this sense are aimed less at the citizens’ knowledge 
than at their preferences. 

In his own solution to something like the paradox 
of voting identified by social choice theory, Rousseau 
restricts the domain of the preferences, or wills, that 
can be or are likely to be expressed by the citizens 
through voting. The first domain restriction Rousseau 
imposes is a formal one, for he argues that laws must 
be general in form: “Thus just as a private will can- 
not represent the general will, the general will in turn 
changes its nature when it has a particular object; and 
as a general will it cannot pass judgment on either a 
man or a fact” (SC I.4). The subject of the law must 
be “general like the will that enacts,’ and “there is 
no general will concerning a particular object” (11.6). 
This is precisely Rousseau’s logic for delegating the 
execution of the law to a separate body. A number of 
contemporary democratic theorists, in part following 
Rawls (1971), address the issues raised by social choice 
theory by suggesting analogous restrictions on what can 
legitimately considered in deliberation (Dryzek 2000, 
42-47; Gutmann and Thompson 1996). 

The second way in which Rousseau restricts the do- 
main is to limit the variety of preferences that are likely 
to be expressed in his state. That is, if the possibility 
of cycling occurs where there are at least three vot- 
ers and three alternatives, one could solve the prob- 
lem by reducing the number of voters or alternatives 
to two or fewer. Rousseau tries to accomplish this 
by increasing the homogeneity of the citizens, which 
would limit the variety of preferences. In principle, 
this strategy would approach a situation where there 
is a uniform preference-ordering among all citizens or, 
more realistically, one sufficiently dominant preference 
or preference-ordering. As Trachtenberg (1993, 256) 
notes, Arrow (1963) himself recognizes this possible 
solution to his paradox when he explains that “the 
possibility of social welfare judgments rests upon a 
similarity of attitudes toward social alternatives” (69). 
The question for Arrow would be whether such a strat- 
egy would violate his condition of citizens’ sovereignty, 
which dictates that the social welfare function is not to 
be imposed (28-30). 

Rousseau’s awareness of the strategy of domain re- 
striction can be seen from his treatment of different 
“wills” present in the same person or group of people. 
The most general case is the conflict he acknowledges 
between the potentially competing wills each person 
has as an individual and as a citizen: “Indeed, each 
individual can, as a man, have a private will contrary to 
or differing from the general will he has as a citizen.” 
In order to solve or at least ameliorate this conflict, 
the general will one has as a citizen must be made 
to dominate the particular will one has as an individ- 
ual. In Rousseau’s infamous formulation, the person 
whose particular will is predominant “will be forced 
to be free” (SC I.7). His point here is clear enough 
for the present purposes without entering into the 
long debate over the meaning of this shocking phrase: 
the citizens all have the same dominant preference, 
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to prefer the common good as a citizen. Rousseau’s 
basic idea can translated into the language of single- 
peaking preference curves, for as Riker and Ordeshook 
(1973) explain, “reflect a cultural uniformity about 
the standard of judgment, even though people differ 
about what ought to be chosen under that standard” 
(105). 

In order to encourage the dominance of the gen- 
eral will and also to limit the range of likely prefer- 
ences, Rousseau argues that civic-mindedness must be 
instilled through education and civic religion and com- 
mon customs, opinions, and mores (see SC IL.7, 11.12, 
IV.8). If the citizens all think of themselves as being 
the same, then their individual preferences become 
common preferences. “Why is the general will always 
right and why do all constantly want the happiness of 
each,” Rousseau explains, “if not because there is no 
one who does not apply this word each to himself, and 
does not think of himself as he votes for all?” (II.4). 
Relative equality among the citizens further reinforces 
this mutual identification (see I.9, 1.11), for “the pri- 
vate will tends by its nature toward preferences, and the 
general will toward equality” (0.1). Rousseau’s advice 
to eliminate or disperse “partial societies” has a sim- 
ilar intention, for these factions multiply preferences 
and preference-orderings (I.3; see also Grofman and 
Feld 1988). Overall, then, limiting the heterogeneity 
of preferences makes the restriction of the domain to 
generalizable preferences even more effective both as 
a solution to the paradox of voting and as a way to 
prevent agenda-setting. 

The most elaborate treatment of the issue of compet- 
ing preference-orderings comes, revealingly enough, in 
Rousseau’s discussion of a problem posed by creating 
a separate executive power: 


We can distinguish thres essentially different wills in the 
person of the magistrate. First, the individual’s own will, 
which tends only toward his particular advantage. Sec- 
ond, the common will of the magistrates, which relates 
uniquely to the advantage of the prince [i.e., government]; 
which may be called the corporate will.... Third, the will 
of the people or the sovereign will, which is general in 
relation to the State considered as a whole and in relation 
to the government considered as a part. (SC 0.2; trans. 
altered) 


“In perfect legislation,” he explains, “the particular 
or individual will should be null; the corporate will 
of the government very subordinate; and consequently 
the general or sovereign will always dominant and 
the unique rule of all the others.” In other words, 
if the individual will is rendered “null,” in principle 
there are only two preferences with a determinate 
preference-ordering. But there is a problem, for nature 
does not follow the dictates of politics. The “natural 
order” that derives from the primacy of natural self- 
love or self-interest produces a “preference-ordering” 
of these three wills that is “exactly the opposite” to 
that required by politics, with the will of the individ- 
ual predominant, the corporate will secondary, and 
the general will weakest of all (1.2). It is the inevitable 
ascendancy of this natural order, especially among 
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the magistrates, that Rousseau credits with the death 
of the body politic. He attempts to ameliorate this 
problem both through institutional means, keeping the 
executive power within its bounds and insisting that the 
people assemble often to exercise their general will, 
and through cultural means, inspiring magistrates with 
the same public-spiritedness that must reign among the 
citizens. 

Rousseau’s awareness both of agenda-setting and of 
the solution to the paradox of voting through domain 
restriction comes to light in the chapter from The Social 
Contract that ends with the second passage examined 
above in which he sarcastically warns against the gov- 
ernment’s desire to usurp the power to propose laws. 
The beginning of the chapter is more hopeful: 


As long as several men together consider themselves to be 
a single body, they have only a single will, which relates to 
their common preservation and the general welfare. Then 
all the mechanisms of the State are vigorous and simple, 
its maxims are clear and luminous, it has no tangled, con- 
tradictory interests; the common good is clearly apparent 
everywhere, and requires only good sense to be perceived. 
Peace, union, and equality are enemies of political sub- 
tleties.... (SC IV.1) 


In such a world, there is no room for subtle manipula- 
tions such as agenda-setting and voting is unproblem- 
atic. But Rousseau is not naive. Having painted his pic- 
ture of peasants under an oak tree reaching immediate 
and unanimous decisions, he warns that the govern- 
ment is all too careful to accord to themselves the right 
to give an opinion, to make propositions, to analyze, 
to discuss (IV.1). The art of political manipulation is a 
threat to the democratic society he portrays, but he also 
knows that his portrait is idealized. “If there were no 
different interests, the common interest, which would 
never encounter any obstacle, would scarcely be felt. 
Everything would run smoothly by itself and politics 
would cease to be an art” (1.3, n.). 

Everything does not run smoothly of itself, of course, 
and Rousseau is well aware of this. If part of his agenda 
is to prevent agenda-setting by the government in the 
form of legislative initiative, Rousseau’s argument that 
individual self-interest or self-love is at the core of hu- 
man nature does not allow him to entertain political 
perpetual motion. He calls for a higher political art 
that is a higher form of agenda-setting. This is the art 
of the great legislator, who must create the restric- 
tions on the domain of preferences that make self- 
legislation in the absence of agenda-setting possible 
(SC 11.7). Rousseau says that his role is “extraordi- 
nary,” and it is both extraordinary in the sense of be- 
ing necessarily outside the normal course of the law— 
otherwise such “law-making” would be illegitimate by 
Rousseau’s standard of self-rule—and extraordinary in 
its difficulty (11.7). Since the political spirit and there- 
fore political institutions tend to decay, Rousseau also 
suggests that analogous interventions into the polit- 
ical machine are necessary. “Just as the regimen of 
healthy people is not suited to the sick, one must not 
want to govern a corrupt people by the same laws that 
are suited to a good people” (IV.4). In his extended 


143 


Rousseau’s Anti Agenda-Setting Agenda 


discussion of Rome (IV.4~7), for example, he argues 
that different methods of voting were necessary for 
different purposes and in different times, and suggests 
that divisions of power between the patricians and the 
plebians became necessary, as did the institution of the 
tribunes. The erection of these institutions and their 
actions can be understood as forms of agenda-setting. 
Such agenda-setting is possible, even necessary, for the 
continued working of Rousseau’s state. However, they 
are legitimate if and only if they create the conditions 
necessary for the active and effective exercise of the 
sovereign people through voting. As with the power 
of legislative initiative he refuses to delegate to the 
executive, Rousseau is ever cognizant of the danger 
they pose to popular sovereignty. 

In conclusion, we should ask what we can learn 
from Rousseau’s approach to the problems of agenda- 
setting, especially the paradoxes of voting central to 
contemporary democratic and social choice theory. If 
the solution to the paradox of voting requires a high 
depree of cultural homogeneity dependent on a sort of 
higher-order agenda control, is this solution possible 
or desirable in the modern state? If Rousseau himself 
admits that the strict separation of powers necessary 
to keep the government within its bounds of execut- 
ing the law is inherently fragile, is democratic self- 
legislation sustainable? Let me suggest two answers. 
First, the trade-offs Rousseau makes us consider in his 
treatment of these issues underscore the fact that these 
questions have no easy answers. Attempts to take up 
Rousseau’s thought as a cudgel on behalf of partici- 
patory democracy while dismissing the paradoxes of 
voting identified by social choice theory or to dismiss 
his thought as naive democratic theorizing ignorant 
of the issue of agenda-setting alike underestimate the 
thinker. They perhaps also underestimate the difficulty 
of the theoretical and practical issues in contemporary 
democratic and social choice theory. Second, the stark, 
even intemperate, solutions Rousseau proposes can il- 
luminate the questions and problems perhaps less evi- 
dent in more familiar forms. For example, his approach 
to the paradox of voting through domain restriction 
takes less dramatic form today in analyses in politi- 
cal science of the tendency of issues and ideology to 
become arrayed along a single dimension, sometimes 
through agenda-setting by parties and politicians, Like 
agenda-setting institutions themselves, this tendency 
might be considered a happy example of how seemingly 
intractable theoretical problems of voting are resolved 
in practice, as lamentable restrictions of democratic 
deliberation, or as an issue requiring further reflec- 
tion and study. The questions of institutions and vot- 
ing Rousseau addresses in his theory are still with us 
today. 
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who execute this function and Rousseau “does not positively argue that commissaires, magis- 


W: drafts the laws in Rousseau’s ideal state? According to John T. Scott it is the people alone 


trates, or anyone else has the power to propose laws.” Challenging this view, I argue that the 
philosopher's varied statements on the subject of lawgiving can be shown to establish merely a right by 
the people to participate in agenda-setting by others. Rousseau believes that the laws must reflect the will 
of the people but not necessarily through their physical writing, drafting, or framing of constitutional 
legislation. More important than popular agenda-setting is the majority’s freedom to check those who 


do draft the laws. Amending my earlier 


that the sovereign is barred from legislating, I reveal 


how Rousseau’s notion of self-legislation as “gatekeeping,” rather than agenda-setting, is central to the 


political theory and institutions of Du contrat social. 


Rule” (Putterman 2003), I argue that the people 

ought to be considered the framers of the laws 
in Rousseau’s ideal state despite their delegating of 
agenda-setting power to experts. In “Rousseau’s Anti- 
Agenda-Setting Agenda and Contemporary Demo- 
cratic Theory,” John T. Scott (2005; preceding arti- 
cle) rejects any delegation of agenda-setting power by 
Rousseau and offers an original explanation for why 
lawmaking is invulnerable to democratic cycling or 
other problems associated with William Riker’s “vot- 
ing paradox.” In his essay, Scott writes that J am “in- 
correct that the sovereign people in Rousseau’s state 
delegate the power of legislative initiative” and he 
identifies what he considers to be a critical error in 
my reading of Du contrat social namely, my belief that 
it is the government that sets the legislative agenda in 
Rousseau’s state. He argues that Rousseau’s sovereign 
not only ratifies but drafts the constitutional laws, thus 
giving the people even greater latitude of legislative 
control. According to Scott, the philosopher “does not 
positively argue that commissaires, magistrates, or any- 
one else has the power to propose laws.” 

In my view, Scott offers the most cogent argu- 
ment for popular agenda-setting in Rousseau’s state 
to date. Over the years, this pure or unmediated view 
of Rousseauian lawmaking has been voiced by a num- 
ber of commentators (Cobban 1934, 91; Dent 1988, 
172; Ellenburg 1976, 159-60; Miller 1984, 64; Vaughan 
1915, 187) but without elaboration. Accurately, in my 
view, in the first half of his essay he shows how my 
own strong argument that “a body other than the 
sovereign must initiate the laws” is exaggerated. Care- 
ful attention to the context of Rousseau’s statements on 
agenda-setting serves to illustrate the philosopher’s be- 
lief that the people ought to be able to discuss, propose 
and oppose initiatives (Du contrat social [henceforth 
CS] [1997a] IV:i, 121~22/437-38; Lettres écrites de la 
montagne [henceforth LEM] [2001] VII, 249/827-28; 
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VII, 252-53/832-33)./ Any effort to prevent the people 
from participating in the drafting of legislation is 
illegitimate. 

This said, what is less clear is whether the people must 
write the laws themselves or whether the government 
is indeed barred from drafting legislation. It can be 
argued that a number of passages in Rousseau’s polit- 
ical writings directly and indirectly contradict Scott’s 
assertion that the philosopher “nowhere positively ar- 
gues that the government should possess the right to 
propose laws” or that “when he seems to acknowledge 
such a right, he is trying to limit the damage from 
what he otherwise considers an illegitimate arrange- 
ment.” Rather, agenda-setting by the government can 
be shown to be permissible so long as the people 
are free to discuss and oppose any and all initiatives. 
Rousseau’s solution to the danger of government ex- 
pansion is not a wholesale rejection of legislative ex- 
perts but, rather, greater participation by the people in 
the lawmaking process. In this regard, a nonexclusive 
right by the people to participate in the drafting of the 
laws is simply another constitutional protection against 
the danger posed by democracy’s need for experts. This 
protection complements the other creative targeted 
checks against government encroachment articulated 
by Rousseau in Du contrat social and elsewhere. His 
(and democracy’s) need to include the people in law- 
making ought not to be understood to categorically 
exclude “commissaires, magistrates, or anyone else.” 

At the outset, it should be noted that in a key re- 
spect Scott’s critique is closer to my own interpretation 
than might appear. This similarity explains, in part, why 
agenda-setting by the government in Rousseau’s state 
is not examined in any length or depth in my earlier 
essay and—mote significantly—why Scott’s critique is 
best directed against those commentators who identify 
Rousseauian “democratic self-legislation” as a politi- 
cal problem (Cullen 1993, 152-53; Johnson 1999, 87, 
118; Fralin 1978, 54). It is true that I argue that the 


l All of the page numbers and citations from Rousseau’s wri 
refer to 1) the English translations located in the references and 2) 
the standard five volume Gallimard edition of the Guvres Completes 
(1959, 1964, 1964, 1969, 1995). 


145 


Rousseau on the People as Legislative Gatekeepers 


philosopher “bars the majority from both framing 
and ratifying the laws”(Putterman 2003, 466), yet it 
must be emphasized that the thesis of my essay 
is the absolute superiority of the people over repre- 
sentatives with regard to all types of legislation, 
Le., constitutional and civil and criminal law. The 
essay is a study of majority rule over those who 
frame the agenda and why “legislative commussaires 
are neither the primary nor even the secondary means 
of engendering good laws”(463) in Rousseau’s state. 
I am emphatic that majority rule means “specifically, 
tule by the people in the broadest sense of the term 
as active, influential, and practically meaningful partic- 
ipation in lawmaking” (459). I write that though the 
people do not pen specific laws, they may “decide in a 
very general fashion, perhaps, ‘the topics announced 
on the agenda’ in the same manner that Rousseau 
suggests Poles be permitted to decide the agenda of 
the Diet” (466). Agenda-setting by experts is said to be 
compatible with democracy because Rousseau demon- 
strates how the danger posed to self-rule by experts 
can be successfully repelled. The laws reflect the will of 
the people but not necessarily through their physical 
drafting of constitutional legislation. 

Must the people draft the laws? There are three good 
reasons for believing this to be the case. Of these, 
Scott explores the first two: Rousseau’s emphasis on 
government as being merely an executive power and, 
critically, how the sovereign is a legislative power ca- 
pable of passing laws with the clarity and unanimity 
of simple peasants deliberating beneath an oak tree. 
Of the first, Rousseau writes that when the govern- 
ment shares in legislation the result is an illegitimate 
division of sovereignty (CS Wi, 82/395; Iiv, 90/404; 
Economie Politique [henceforth EP] 6/244; LEM, IV, 
232/808). Anytime the government passes laws it vio- 
lates the necessary separation of powers that must be 
respected for the laws to remain general. As a wider 
danger, his belief is that whatever benefits the state 
might reap from agenda-setting by government will 
be undermined ultimately by its members’ insatiable 
drive for power. “It would be a great folly,” according 
to the philosopher, “to hope that those who are mas- 
ters in fact would prefer some interest to their own” 
(EP 9/274). 

In the seventh and eighth of his Lettres écrites de 
la montagne, Rousseau offers bis most positive state- 
ments on popular agenda-setting. Speaking of the peo- 
ple (or the closest thing to it, the Conseil générale), 
he writes that “it is up to the Legislator, it is up to 
the drafter of the Laws not to leave equivocal terms 
in them. When they are there, it is up to the equity of 
the Magistrate to fix them in practice...” (LEM VII, 
277/862). More sharply, he asks “Isn’t it contrary to all 
reason that the executive body rule the public order of 
the Legislative body, that it prescribe to it the matters 
it must take cognizance of, that it forbid it the right 
of giving an opinion, and that it exercise its absolute 
power even in the acts made to hold it within limits?” 
(VU, 251/830) 

A second reason for believing that it is the people 
who set the legislative agenda is Rousseau’s assertion 
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that “as it becomes necessary to promulgate new ones 
[laws], this necessity is universally seen” (CS IVi, 
121/437). In the ideal state, guidance by an elite is 
unnecessary because “the springs of the State are vig- 
orous and simple, it maxims are clear and perspicuous, 
it has no confused and contradictory interests, the com- 
mon good is everywhere fully evident and requires only 
good sense to be perceived” (IV:1, 121/437). The people 
do not need legislative experts because the majority 
knows what it wants and has no “confused or contradic- 
tory interests.” With such a heightened degree of clarity 
each citizen would not “propose new Laws according 
to his fancy” (Discours sur l’origine et les fondemens de 
Vinegalité parmi les hommes [henceforth DJ] [1997a] 
116-17/114) but according to universally shared con- 
ceptions of right. This is possible because citizens raised 
in such a perfect polity would not look like persons 
raised in less opportune circumstances. Ideal citizens 
would differ from others—including Genevans—for 
example, whom he instructs “you are neither Romans, 
nor Spartans; you are not even Athenians. Leave aside 
these great names that do not suit you. You are Mer- 
chants, Artisans, Bourgeois... people for whom every 
liberty is only a means for acquiring without obstacle 
and for possessing in safety” (LEM IX, 292-93/881). 
Genevans’ preoccupation with modern liberty reaps 
material rewards but is ill- suited for achieving either 
the kind or the degree of freedom Rousseau’s asso- 
ciates with self-governance. 

A third reason for concluding that Rousseau’s laws 
are drafted by the people alone is that citizens raised 
under good laws would not need the supervision re- 
quired of a large majority. It is not only legislative 
expertise that is superfluous but also policing. He writes 
that, unlike the citizens of Geneva, “a restless, un- 
occupied, turbulent people always ready, for lack of 
private business, to get mixed up in that of the State, 
needs to be held within bounds” (LEM IX, 293/881). 
Even in Geneva “the public order of your general 
Council is the easiest thing in the world; let them sin- 
cerely wish to establish it for the public good, then 
everything will be free there, and everything will take 
place there more tranquilly than today” (VII, 252/832). 
This placidity originates from Genevans’ differences 
from Greeks and Romans in the measure of their 
private pursuits and occupations. One can speculate 
that subjects in Rousseau’s ideal state would not need 
“to be held within bounds” because of their educa- 
tion and good citizenship. Although, the philosopher 
bars the people from self-convoking themselves into 
an assembly because, among other reasons, he fears 
the risk of “seditious tumult” (CS DU1:xii, 111/426; 
Ii:xviii, 119/435), he does not appear to consider this 
risk so great as to bar them from drafting their own 
laws. 

Each of these arguments supports Scott’s belief that 
it is the people alone who draft the laws in Rousseau’s 
state. As I show below, though, this view is less defensi- 
ble than may appear. Rather than requiring everybody 
to draft the laws Rousseau demands only that the peo- 
ple’s right to discuss, propose, and oppose legislation 
be respected. 
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GENEVA AS A FLAWED MODEL FOR 
DU CONTRAT SOCIAL 


Scott’s belief that Rousseau does not “delegate the 
power of legislative initiative” can only be made con- 
vincingly if readers are willing to ignore key sections 
of the philosopher’s political writings that contradict 
this view. These sections, some of which I describe as 
“soft ground,” owing to uncertainty about the author’s 
motivations, Scott considers to be groundless because 
they refer to Geneva, a nonideal state. The philoso- 
pher is said to be merely “trying to limit the damage 
from what he otherwise considers an illegitimate ar- 
rangement.” Scott rightly notes that I characterize the 
Dédicace to the Second Discours and Lettres écrites 
de la montagne as “soft ground upon which to stake 
a claim that representatives are the locus for political 
authority” (Putterman 2003, 462-63), yet it is also true 
that many of the prescriptions for Geneva in the latter 
work are consistent with key aspects of the political 
theory of Du contrat social (LEM VI, 232/808; VII, 
261/842). Notably, Rousseau labels agenda-setting by 
government to be “one of the best maxims which good 
sense might dictate regarding the constitution of a 
government” (DI 117/114) and he identifies Geneva’s 
constitution as being the basis for the positive political 
program of Du contrat social expressly while still con- 
demning its systemic legislative flaws. He writes, “I took 
your Constitution, which J found to be beautiful, as the 
model for political institutions” and asks, “that primi- 
tive Contract, that essence of Sovereignty, that empire 
of the Laws, that institution of Government... isn’t this 
stroke for stoke the image of your Republic, since its 
birth up to this day?” (LEM VI, 233/809). 

In his discussion of the “right of legislation,” specif- 
ically, in the Dédicace, Rousseau writes that “I should 
have sought out a country” where “not everyone has 
the power to propose new Laws according to his 
fancy; that this right belong to the magistrates alone” 
(DI 116-17/113-14). In this letter the philosopher is 
not merely describing a status quo situation but posi- 
tively endorsing (“J’ aurois cherché un Pais”), among 
other things, legislative interference by magistrates. His 
belief is that a people ought to be “content to ratify 
the Laws and decide the most important public busi- 
ness in a Body on the recommendations of the Chiefs” 
(116-17/114). Why this is so is explained: 


However useful new Laws might be, their advantages are 
almost always less certain than their dangers are great. In 
this regard when the Citizen, when the Bourgeois has pro- 
posed his opinion he has done his duty, he ought moreover 
to have enough confidence in his Magistrate to judge him 
capable of weighing the advantage of what he proposes to 
him and inclined to approve ıt if he believes it useful to the 
public good. Thus the Laws has very wisely provided that 
the establishment and even the proposing of such innova- 
tions would not pass without the approval of the Councils, 
and that is what the negative right they lay claim to consists 
of, which, according to me, incontestably belongs to them.” 
(LEM VIL, 264-65/846-47) 


In this vein, Helena Rosenblatt (1997, 162) deems 
the descriptions of agenda-setting in the Dédicace to 
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the Second Discours and the Lettres écrites de la mon- 
tagne accurate. Despite her skepticism about the sin- 
cerity of the philosopher’s praise for the Petit conseil 
in these works, she claims that Rousseau “disapproved 
of both popular legislative initiative and popular ref- 
erendums. The role of the citizens was to approve or 
reject laws and to elect officials capable of enforcing 
these laws for them.” 

Even assuming Rousseau to be insincere about 
Geneva being the political model for Du contrat social, 
there are passages in the latter work that refute Scott’s 
claim that the philosopher “does not positively argue 
that commissaires, magistrates, or anyone else has the 
power to propose laws.” Unfortunately, the import of 
these passages is obscured by the label that he assigns 
to legislative agenda-setting as a popular right: “gate- 
keeping.” Scott writes, “I follow Putterman in using the 
term ‘agenda-setting’ to refer to the power of legislative 
initiative, although this power is more properly a form 
of ‘gatekeeping,’ which is merely one form of agenda- 
setting.” In Rousseau’s writings the two concepts of 
agenda-setting and gatekeeping are distinct. With re- 
spect to the former, it refers to the drafting, framing, or 
writing of the laws expressly. Conversely, “gatekeep- 
ing,” a term that is not Rousseau’s but still appropriate, 
refers to the people’s ability to check others who frame 
the legislative agenda on their behalf. The people need 
not write the laws by themselves but they must be able 
to veto any initiatives with which they disagree prior to 
the vote. As Scott comments, this oversight is one “form 
of agenda-setting” because the people are empowered 
to discuss, propose, and veto initiatives. This view is 
consonant with the thrust of my earlier essay—that the 
people’s ability to exercise either a direct or an indirect 
“final check” over legislation ensures that they always 
remain the locus of political power in Rousseau’s state. 
One might characterize this as a “double check.” 

The difference between agenda-setting and gate- 
keeping is evident in the seventh Lettre écrites de la 
montagne. There, Rousseau is critical of Geneva’s con- 
stitution for forbidding the people from exercising leg- 
islative oversight, not for permitting the government 
a role in lawmaking. Sovereignty is said to be vio- 
lated because the Petit conseil goes unchecked rather 
than because this lesser body participates in lawmak- 
ing. Rousseau’s complaint is against an inferior body 
“which alone proposes, which decides first, and whose 
voice alone, even in its own cause, permits its superiors 
to have one” (LEM VI, 252/832). The Petit conseil 
“alone has its own activity, that gives theirs to all the 
others, that in all of them, supporting the resolutions 
it has taken, gives an opinion two times and votes 
three times” (VI, 253/832-33). His view is that it is 
unreasonable “to make it so that none can propose 
anything” except “those who have the greatest interest 
in harming it” because “if the existence of the gen- 
eral Council depended completely on the small Coun- 
cil, the general Council would be suppressed forever” 
(VII, 249/828). In the end, the “sovereign power is 
enchained” if the people “cannot act except when it 
pleases your Magistrates, nor speak except when they 
interrogate” it (VI, 238/814). If the sovereign cannot 
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act except when permitted by the government, then the 
laws make it “speak as it pleases” since “these Tribunals 
will certainly not approve propositions that would be 
prejudicial to them” (VIL, 249/827-28). It is apparent 
that everything is “dead in the Republic” (VII, 252/832) 
when the sovereign cannot “speak as it pleases,” but 
this is not the same thing as saying that it is only alive 
when it speaks unassisted. 

Rousseau’s efforts to ensure that the sovereign 
“speaks as it pleases” inform his definition of sepa- 
ration of powers. His barring of any and all forms of 
legislation by the government (CS Ii, 82/395; Hiiy, 
90/404; EP 6/244; LEM IV, 232/808) can be shown to 
refer not to agenda-setting but to impositions placed on 
lawmaking that the people cannot overturn. Whenever 
the majority is prevented from proposing, opposing, or 
deliberating on a law, then it is the government who is 
sovereign. What divides sovereignty is not the presence 
of foreigners but the presence of a will that is foreign 
and binding. By itself, the sharing of legislative power 
does not constitute a separation of powers as long as 
the framers of the laws remain, as I argued earlier, 
“entirely and permanently subject” (Putterman 2003, 
464) to the popular will. It is this same logic that applies 
to Rousseau’s view of the “commands of the chiefs” 
in Du contrat social. Unchecked, government’s com- 
mands or decrees are wholly illegitimate, yet they can 
still “be taken for general wills so long as the sovereign 
is free to oppose them and does not do so” (CS I:I, 
57-58), according to Rousseau. It is this express free- 
dom to oppose the chiefs—not the issuing of commands 
personally—that makes rule by government possible. 

This interpretation also finds precedence in 
Rousseau’s description of the législateur in Du con- 
trat social as a body that “drafts the laws” and yet 
has no “legislative mght” in the form of a final check 
(CS Il:vii, 70/383). This function, drafting the laws, 
appears to devolve to the Prince (or government) in 
Rousseau’s comparison between the “inventor” of the 
laws (the législateur) and the “workman” (the Prince) 
who replaces him. He writes that the Prince “need only 
follow the model which the other must propose. He 
is the mechanic who invents the machine, the first is 
nothing but the workman who assembles and operates 
it” (I:vii, 69/381). In this line the philosopher does 
not describe the Prince as merely operating the sys- 
tem of legislation created by the législateur but says he 
“assembles” or builds it also (“l’ouvrier qui la monte 
et la fait marcher”). Considering that laws are what 
the législateur “assembles,” it is not entirely unrea- 
sonable to assume that Rousseau is referring to the 
assembling of legislation, i.e., that the Prince inherits 
the législateur’s power of agenda-setting. As I explain 
in my essay, though (drawing from Emile I, 39/248; IV, 
219/500), the chief difference between the respective 
functions of the législateur and the Prince is that the lat- 
ter does not “remake individuals” but merely strives to 
“extend and strengthen” citizens’ natural dispositions 
to give their passions “order and regularity” (Putter- 
man 2003, 467). As I note, Fralin (1978) and others 
exaggerate the degree of interference that any simi- 
larity between the two institutions might suggest with 
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regard to the writing of civil and criminal law to the 
point of arguing that “the government ought to exert a 
preponderant role in legislation.” What Rousseau ac- 
tually believes is that government’s role in lJawmaking 
must be adequately checked but that this restriction 
need not bar any role in agenda-setting. 


AGENDA-SETTING IN THE IDEAL 
AND NONIDEAL STATE 


The most convincing argument for popular agenda- 
setting in Rousseau’s state is, in my view, the notion that 
experts are superfluous because the volonté générale is 
generally known. What Rousseau says is that when 
this happens, “all of the springs of the State are vig- 
orous and simple, its maxims are clear and perspic- 
uous [claire et lumineuses], it has no confused and 
contradictory interests [and] the common good is ev- 
erywhere fully evident” (CS IVi, 121/437). With such 
clarity each citizen would not “propose new Laws ac- 
cording to his fancy” (DI 116-17/114), thus making 
interference by experts an unn risk. Proposals 
would not be fanciful at all, but “what all have already 
sensed” (CS IVi, 121/437). As Scott (2004) remarks, 
this shared conception of right requires only a “specific 
form and formality” to be realizable. When this occurs 
agenda-setting and ratification are near-seamless acts 
by the people, as they express the volonté générale in 
unanimity. 

The central problem with this interpretation is that 
Rousseau characterizes lawmaking in his ideal state 
as achieving this heightened degree of transparency, 
yet he imposes restrictions on its exercise. Voting is 
free and transparent but what takes place before and 
afterward is decidedly less so. Owing to this differ- 
ence, the philosopher’s comparison between citizens 
and peasants in Book VI:i of Du contrat social can be 
construed as simply an ideal that the laws ought to 
embody rather than how this ideal is arrived at in a 
perfect state. Notably, in this passage Rousseau writes 
that “as it becomes necessary to promulgate new [laws], 
this necessity is universally seen“ (CS IV:i, 121/437), 
yet nowhere in the political program of Du contrat 
social does he allow the people to freely and regularly 
convene themselves when this need becomes known. 
All may recognize the need to pass new laws but the 
only body authorized to act on this need on a regular 
basis is the government (A:xiii, 111/426). This is true 
not only for citizens in states “badly constituted from 
their origin” (IV:i, 121/437) but also for citizens in his 
ideal state. 

Rousseau’s justification for this rule is that the de- 
cision to convene must be prescribed by law because 
it is a particular act. This said, this stricture also ap- 
pears to derive from the philosopher’s anxiety over 
the ascent of a “proud and restless” citizenry who, 
like Rome’s unruly Comutia, pursues rash or imprudent 
change. Although his idealized citizens are of a wholly 
different temperament than the tumultuous or sedi- 
tious Romans that he (and Montesquieu [1734] 1965, 
93) alternatively praises and condemns, he considers 
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this a fragile disposition. He appears to fear the emer- 
gence of an inspired citizenry that eagerly “flies to 
the assemblies” (CS Il:xv, 113/429), ever becoming 
like the ancient Romans, sovereign members becoming 
“proud and restless” as the laws weaken over time. Al- 
though Rousseau claims that only large states “require 
supervision [police|” and praises Athens’ “assembly 
of several thousands of quick-tempered, ardent, and 
almost unrestrained Citizens” for needing minimal su- 
pervision, he also says that assemblies that number in 
the thousands or tens of thousands, such as Rome’s 
Comitia, require restraint (CS IV:iv, 132/449; LEM 
VU, 251/830-31; IX, 293/881). Like the lawgivers of 
old, his solution to this problem appears to be a “spe- 
cific form and formality” for lawmaking that includes, 
among other things, a role for government in the early 
stages of the legislative process. Similar to the people’s 
ability to oppose or veto legislative initiatives, though, 
the philosopher allows the people to self-convene if and 
when the government fails to exercise its duties in good 
faith. 

Similarly, Rousseau’s remarks about the censor re- 
veal a role for government in the early stages of the leg- 
islative process. In Du contrat social, the censor is said 
to be an integral part of the state owing to citizens’ need 
for moral rectitude. Unlike the foundational législateur, 
though, the censor is not a transitory body that de- 
parts with it but, rather, remains in existence to guide 
opinion as “legislation weakens” (CS IV:vii, 141/458), 
as the state needs to be “constantly vigilant to main- 
tain or restore love of country and good mores among 
the people” (EP 126/262). A people’s “opinions arise 
from its constitution” according to Rousseau, and “the 
Censorship maintains morals by preventing opinions 
from becoming corrupt, by preserving their upright- 
ness through wise applications” of the laws (CS IV:vii, 
141/459). For these “wise applications” to be effective, 
though, it is necessary that the people be kept unaware 
of the actual intent behind certain laws in order to 
maintain their desired effect on opinion. Popular igno- 
rance makes it possible for the censor to enunciate the 
status of opinion on a particular question through refer- 
ence to a law that, on its face, appears wholly unrelated. 
This is why, according to the Genevan, laws designed to 
influence “morals, customs, and above all opinion” are 
that “which the great Lawgiver attends in secret, while 
he appears to restrict himself to particular regulations 
which are but the ribs of the arch of which morals, 
slower to arise, in the end form the immovable Key- 
stone” (U:xii, 81/394). This fourth “kind of law” that 
Rousseau assigns to the censor to administer wisely 
(IV:vii, 141/459) works best when the true object of 
each and every regulation is not fully known. All ought 
to know the intent behind most laws but not every law. 
Notably, in Book I:xii of Du contrat social, Rousseau 
refers to constitutional legislation specifically, rather 
than to the commands or decrees that would fall under 
government’s jurisdiction even before the législateur 
departs. All of these (constitutional law, commands, 
and decrees) are the object of censorship. 

Assuming that it were the case that the right of ini- 
tiation belongs to the people alone, then this aspect 
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of Rousseau’s theory is an enigma.’ Drafting all of 
the laws on their own, the people would be aware 
of their original intent and—all secrets known—the 
censor would be left having to search for hidden appli- 
cations of the law where there were none. Conversely, 
as the censor declared the status of prevailing opinion 
on any issue of controversy or recently passed law, the 
people would be able to challenge its judgment as a 
matter of right. Not only its discretion but the very basis 
of this discretion would be in dispute (its chief source 
of political power). In his Lettres écrites de la montagne, 
Rousseau asserts that when there is a controversy over 
interpretations of the law, then, at least in Geneva, the 
arbiter ought to be the people (the Conseil général). It is 
“up to the equity of the Magistrate to fix their meaning 
in practice,” but “this right does not at all go to the 
point of changing the literal sense of the laws.” When 
there is a danger of such obfuscation, the “good sense 
that pronounces is then found in the general Council” 
(LEM VU, 277/861). Yet such similar-type good sense 
in his ideal state would prove to be highly problematic 
for laws intended to preserve good moeurs. The censor 
would have to enunciate the status of opinion on a 
particular question without reference to any law or 
only to those laws that are drafted by all. Yet a solution 
to this conundrum exists if it is assumed that the people 
do not legislate unassisted. What the people enjoy, as 
Rousseau highlights in the Lettres, is a nonexclusive 
right to participate in lawmaking. They may oppose, 
discuss, and initiate legislation but it is assumed that 
not every law will be decided by the people alone. 
But there are other enigmas too. If Scott is indeed 
correct that Rousseau “nowhere positively asserts that 
the government should possess the right to propose 
laws,” itis even more true that the philosopher nowhere 
positively assigns this power to the people alone. Pop- 
ular agenda-setting must be inferred from Rousseau’s 
many statements that sovereignty not be divided 
(CS IMi, 82/395; Miv, 90/404; EP 6/244; LEM IV, 232/ 
808) because it is never argued for expressly. On the 
topic of agenda-setting, specifically, in his political writ- 
ings, the Genevan asks only that the people’s right to 
propose, discuss, and oppose legislation be respected. 
Readers are denied any opportunity to parse out his 
language on the subject because the rudiments or me- 
chanics of popular agenda-setting in his ideal state are 
left unexplained in both versions of Du contrat social 


2 This may be one of many enigmas. Scott offers an onginal so- 
lution to the problem posed to self-legislation created by Riker’s 
voting paradox. It would be interesting to examine whether other 
difficulties exist as well. One might speculate that agenda-setting 
by the people could be problematic because, like a self-convoking 
assombly, it might result m the people “pronouncing judgment on a 
particular fact” (CS, Mav, 62/374). Unlike a straight up-or-down 
vote on ratifying or rejecting various initiatives already known to 
be general, the people would have to pass judgments upon the par- 
ticulars or content of the laws Of course, agenda-setting does not 
represent a formal expression of the volonté générale by the people 
but it still may be problematic if lawgiving (writing legislation) 1s con- 
sidered to be in any way a sovereign act. Anyhow, beyond this (likely 
idle) speculation it would be interesting to consider other possible 
problems to agenda-setting by the people other than Riker’s voting 
paradox More interesting would be Rousseau’s possible solutions. 
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Rousseau on the People as Legislative Gatekeepers 


For a philosopher who harbored fears about his polit- 
ical program being relegated to the “land of chimeras” 
(LEM IX, 234/810), this silence is striking. Though 
Rousseau might have anticipated the skepticism of crit- 
ics, apparently he never sought to explain how popular 
agenda-setting might work or give substance to a politi- 
cal proposal that, arguably, is the most chimerical of all. 
This silence appears especially curious in light of the 
expansive discussion of voting in Du contrat social. Pre- 
sumably, agenda-setting and voting would prove to be 
identical functions for a people without any “confused 
or contradictory interests” who are deliberating over 
“clear and perspicuous” maxims (CS [V:i, 121/437). For 
a philosopher writing for the children of Adam it was 
incumbent upon Rousseau to explain the relationship 
between the two. Yet he doesn’t. To locate an answer 
one must turn to the only discussions anywhere in his 
political writings where the subject is broached at all: 
the Dédicace to the Second Discours and the Lettres 
écrites de la montagne. And it is in these works that 
Rousseau stresses the need for legislative experts. 

Finally, assuming that Scott is indeed correct that 
the right of initiation belongs to the people alone in 
Rousseau’s ideal state, there are solid reasons for con- 
cluding that this is not to be the case for lawgiving in 
Poland, Corsica, Geneva, or any of his other nonideal 
states. On this point Scott appears to agree: it may 
be that the Genevan believes a virtuous citizenry living 
under a well-ordered constitution ought to draft its own 
laws, but it is not his view that this power should be 
perpetuated after morals decline. The philosopher ex- 
presses this viewpoint in a number of writings in a 
manner consistent with Scott’s description of popular 
agenda-setting as adhering to a “specific form or for- 
mality.” His belief is that once the people are no longer 
able to recognize the “clear and perspicuous” volonté 
générale or have become too corrupt to give this will 
concrete expression, they ought not to continue behav- 
ing as if they were village peasants beneath an oak tree. 
As “the common interest diminishes” and the volonté 
générale is no longer “universally seen” (CS IVi, 121/ 
438), each citizen will “propose new Laws accord- 
ing to his fancy” and these proposals will “almost al- 
ways” be “less certain than their dangers are great” 
(DI 116-17/114). It is because of these dangers that 
Rousseau suggests that within Geneva, the model non- 
ideal state, supposedly, “the Laws have very wisely pro- 
vided that... the proposing of such innovations would 
not pass without the approval of the Councils” (LEM 
VIO, 264-65/846—47). 

In this respect, Rousseau identifies two kinds of 
people who are unfit for unmediated lawmaking in 
nonideal states: “nascent people” with “neither deep- 
rooted customs nor deep-rooted superstitions,” who 
remain capable of “appreciating sound maxims and of 
following the fundamental rules of reason of State” 
(CS Ox, 77/390, U:vii, 69/383), and “corrupt peo- 
ple” who, at their worst, are “competitors, rivals” and 
“enemies” and desirous of wealth and security over 
liberty (LEM IX, 292-93/881). His belief is that the 
latter kind have not known liberty or, if so, have lost 
it owing to moral decay (CS IV:i, 121/438). Their po- 
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litical prospects can be summed up in one of three 
ways: precarious self-rule, partial rule by others, or total 
domination. The first two types of rule form a Hobson’s 
choice that Rousseau chooses between in place of dom- 
ination in the Dédicace to the Second Discours and 
the Lettres écrites de la montagne. His view is that in 
a poorly constituted state it is best to avoid fanciful 
or dangerous initiatives and seek wise counsel (“the 
recommendations of the Chiefs”) and that this delega- 
tion of legislative power to experts does not undermine 
the liberty of those who are incognizant of the volonté 
générale and know liberty only in an “illusory and vain 
form” (CS IV:i, 122/438). Rather, it allows those who 
are the least capable of self-governance from having 
to draft laws by themselves and, as Rousseau explains, 
blundering fatally. This is why “the Laws have very 
wisely provided that the establishment and even the 
proposing of such innovations would not pass without 
the approval of the Councils” (LEM VII, 265/847). 
Ideally, agenda-setting by experts is a temporary and 
partial bulwark against total domination if it is com- 
plemented by the people’s right to discuss, propose, 
and oppose initiatives. As risky as this may be (if the 
check is uncertain), it is better than the prospect of 
“merchants, artisans, and bourgeoisie” behaving as for- 
est peasants—for political liberty will be even more 
short-lived for a people who put security and private 
interest above the common good. This said, in the long 
term either choice will be fatal if the people are too 
corrupt to adequately check those whom they appoint 
as framers because persons who are “completely ab- 
sorbed in their domestic occupations” will not be care- 
ful “about casting light on the conduct of their lead- 
ers” to recognize “the chains that are being prepared 
for them” until “they feel their weight.” Unlike ideal 
citizens, who vigilantly guard their liberty through ac- 
tive participation in lawmaking, such types will always 
be “occupied with their private interests, with their 
work, with their trafficking, with their gain” and, in 
the end, wind up “always distracted, always deceived” 
(LEM IX, 292-93/881). 


AGENDA-SETTING IN DU CONTRAT SOCIAL 


For Giovanni Sartori (1987) “a government of experts” 
(433) is admissible in a democracy “as long as what is 
essential—and therefore must be controlled—is kept 
within the area of democratic controls” (432). In Du 
contrat social, Rousseau keeps what is essential within 
the area of democratic controls and, in so doing, makes 
possible a level of popular participation in govern- 
ment that Sartori and other elitists consider impossible. 
The coercive and noncoercive power of these controls 
exercised prior to the vote, the subject of this essay, 
and that exercised after, the subject of my first, reveal 
the philosopher to be no less a democrat if a body 
other than the people drafts the laws. Through tightly 
coupled institutional checks Rousseau serves to ensure 
that the people continue to exercise sovereignty even 
if they do not write the laws alone. The only thing that 
they must do alone is vote. 
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The notion that only the highest or most remote stan- 
dard for democracy—popular agenda-setting—must 
be satisfied for political sovereignty to remain undi- 
vided is at odds with the philosopher’s desire for re- 
alism and his declared emphasis on voting as an ex- 
pression of sovereign rule in Du contrat social. More 
importantly, though, it is superfluous. As long as the 
people retain a nght of initiation that allows them to 
overturn any and all legislative proposals with which 
they disagree, it is not necessary that they draft all of 
the laws personally. With the right of ratification, this 
right to propose, discuss, and veto legislation written by 
others can be regarded as a second check. Even more 
than the idea that it is the people who must draft the 
laws, this interpretation is congruous with the theory 
and institutions of Du contrat social. 
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Notes from the Editor 


ANNOUNCEMENT 


The inaugural issue of the American Political Science Review was published in November of 1906. 
To mark the centennial of the APSR, the November 2006 issue will feature a special section devoted 
to considerations of the evolution of political science. This special section will be an extra feature, 
above and beyond the regular complement of research articles. 

The APSR is actively soliciting submissions on the broad theme of the special section. 

If you are interested in submitting a paper, please contact the editor of the APSR at apsr@gwu.edu 
to express your interest and to provide a brief description of the paper you would like to submit. Also, 
please bring this solicitation to the attention of others who may be interested and encourage them to 
contact the editor. 

Like all other papers submitted to the APSR, submissions received in response to this solicitation 
will undergo peer review. Overseeing this process will be the editor of the APSR, Lee Sigelman, and 
a member of the APSR’s editorial board, M. Elizabeth Sanders of Cornell University. 

To be considered for publication, a paper must be no more than 15-17 pages in length, conven- 
tionally formatted (e.g., double-spaced throughout, including notes and references, with margins of 
at least one inch on all sides and set up in at least an 11-point font size). 

In light of the length constraints, authors are advised to address a specific theme rather than 
aiming at a broader, synoptic disciplinary overview. Pertinent examples would include: an analysis 
of a particular turning point in the evolution of the discipline or of the role played by a particular 
individual; a treatment of the evolution of some influential school of thought; a comparison of the 
evolution of two subfields of political science, or of political science in the U.S. and another area, Or 
of political science and another discipline; or a consideration of how a certain type of scholarship has 
influenced and/or been influenced by public policy, or the relationship between political science and 
the state. (These are offered only as examples, rather than as a definitive set of topics to be addressed.) 

To be considered for publication, submissions must reach the APSR office by no later than January 


2, 2006, but earlier submission is encouraged. 


In this Issue! 


The human genome is a biological magnum opus three 
billion letters long, a tiny portion of which adorns the 
cover of this issue. As an information storage device, 
DNA dwarfs human-made libraries and microchips. 
This remarkable code contains the instructions for 
producing living things from aardvarks to zebras and 
determines traits from sex to allergen sensitivity. Yet 
DNA does not determine many aspects of our lives, 
from the career paths we choose to the types of music 
we prefer. Thus assessments of the relative and joint 
effects of “nature” and “nurture” go on in a vast array 
of contexts, presenting new and ever-changing answers 
to the questions of who we are and why we are what 
we are—biologically and even, as the lead article in 
this issue suggests, politically. The genetic code on our 
cover, which is tinted green in honor of the month of 
May, could well have been shaded in the reds and blues 
of a map of presidential election results in honor of that 
article. 

One often hears about long-separated twins who, 
upon meeting, discover that both like the same foods 
and drive the same make of car. But how about twins 
who vote for the same party and care about the same 
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issues? Genetics meets political science in “Are Politi- 
cal Orientations Genetically Transmitted?” by John R. 
Alford, Carolyn L. Funk and John R. Hibbing. Drawing 
on databases compiled in Virginia and Australia, 
Hibbing, Alford, and Funk conclude that biology 
shapes ideological outlooks, even more than parental 
socialization does. Here, then, is intriguing evidence 
that genetic predispositions play a more prominent role 
than political scientists have recognized. 

If our genes predispose us to take different posi- 
tions on political issues, then electoral calculations may 
work rather differently than previously thought. Scott 
Basinger and Howard Lavine’s “Ambivalence, Infor- 
mation and Electoral Choice” contributes to a rapidly 
developing subfield of electoral research that ques- 
tions conventional wisdom about the determinants of 
vote choice. Voters use various cues to evaluate polit- 
ical candidates—party, ideology, economics, and spe- 
cific issues, among others. Do they weight these fac- 
tors equally? Basinger and Lavine suggest that they 
do not. By creatively synthesizing insights from re- 
search on information processing, congressional elec- 
tions, and public opinion, Basinger and Lavine develop 
an ambivalence-centered model that depicts those who 
hold ambivalent partisan attitudes as distinctive in 
the cues they use to select their favorite candidate. 


Notes from the Editor 


Identifying these cues and examining how voters use 
them to home in on their preferred candidates are 
Basinger and Lavine’s primary contributions in an 
analysis that advances our understanding of electoral 
decisionmaking. 

Voters must not only consider the relative salience 
of different cues, but also devise voting strategies that 
make sense given the institutional context in which 
they find themselves. Of course, the ideas that voters 
act strategically and are influenced by policy outcomes 
are well established, but Orit Kedar (“When Moderate 
Voters Prefer Extreme Parties: Policy Balancing in 
Parliamentary Elections”) imparts some new twists to 
these ideas by probing the interplay of institutions and 
individual behavior. Using survey data from Norway, 
the Netherlands, Britain, and Canada, Kedar argues 
that voters in proportional representation systems of- 
ten engage in “compensatory voting” and favor more 
extreme parties where there is considerable power- 
sharing among parties; by contrast, plurality-based sys- 
tems are likely to motivate more ideologically sincere 
voting patterns. Kedar’s analysis should interest a wide 
readership ranging from American scholars who study 
strategic voting and electoral systems to comparativists 
who are concerned with “bringing the state back in.” 

Whether conducted in a New England town hall or 
a European-style proportional representation system, 
voting is the quintessential collective action activity—a 
modestly time-consuming task for most citizens, but 
not usually a dangerous one. Engaging in other forms 
of collective action, however, can occasion greater risk, 
from bearing the financial costs of an endeavor alone 
to being punished or even executed for challenging the 
status quo. Thus, in politics, fools may rush in where 
angels fear to tread only when they have company. 
Jacob K. Goeree and Charles A. Holt apply the con- 
cept of quantal response equilibrium to several im- 
portant categories of political activity in “An Expla- 
nation of Anomalous Behavior in Models of Political 
Participation.” Their insights help account for failures 
of equilibrium-based models to explain what actually 
happens in real-world situations. For insights about sit- 
uations ranging from the dynamics of anti-government 
demonstrations in Lebanon to the jockeying that goes 
on when incumbent senators announce their retire- 
ment, Goeree and Holt’s analysis merits attention from 
a wide range of readers, including those who might 
ordinarily shy away from an article based on formal 
modeling. 

Another ingredient in the collective action recipe is 
social capital. The creation of social capital through 
civic engagement is often cited as an important factor 
in “making democracy work,” but disagreement per- 
sists what form these civic engagements should take. 
In “Civic Engagement and Mass-Elite Policy Agenda 
Agreement in American Communities,” Kim Quaile 
Hill and Tetsuya Matsubayashi test whether bridging 
or bonding associations produce greater leader respon- 
siveness. In addition to the novel findings they report 
of no connection between bridging associations and 
leader responsiveness and of a negative association 
between bonding associations and such responsiveness, 
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Hill and Matsubayashi introduce innovative measures 
of policy concurrence and member participation. ‘This 
article seems likely to excite both Americanists in the 
behavioral tradition and comparativists interested in 
democratization. 

In the United States, policy outcomes are of course 
shaped by the provisions of the Constitution. Genera- 
tions of American schoolchildren have been taught that 
the Father Of The Constitution was James Madison, 
who dominated the Constitutional Convention with 
his Virginia Plan and then pushed for ratification in 
the Federalist Papers. David B. Robertson challenges 
the textbook version of history in “Madison’s Oppo- 
nents and Constitutional Design.” Robertson argues 
that Madison’s Virginia plan was successfully chal- 
lenged on many fronts by a coherent and influential op- 
position led by Connecticut delegate Roger Sherman. 
Many of the prominent constitutional features we see 
today were the result of path-dependent compromises 
balancing state and national power. Madison lost im- 
portant fights on issues such as federalism and states’ 
rights, intra- and interstate commerce, separation of 
powers, and the composition, powers, and selection of 
the national legislature and executive. Madison may 
have been the Constitution’s father, but he lost many 
a custody battle in its drafting stages. For an audi- 
ence ranging from senior scholars down through be- 
ginning undergraduates, Robertson’s analysis teaches 
some valuable lessons about American politics and the 
Constitution’s origins. 

A major component of American government not 
detailed in the Constitution is the bureaucracy—an 
administrative amalgam of executive, legislative, and 
even judicial functions and features that handles most 
citizen-government interactions. Just as the Founders 
were concerned about the ethics of elected officials, 
Sanford C. Gordon and Catherine Hafer (“Flexing 
Muscle: Corporate Political Expenditures as Signals 
to the Bureaucracy”) wonder whether private firms 
can buy favorable bureaucratic decisions. Gordon and 
Hafer focus on the influence of regulated industries, 
especially nuclear power companies, on enforcement 
decisions of agencies that are charged with regulating 
them. At the heart of corporate influence, Gordon and 
Hafer argue, are the political contributions that firms 
use to signal their willingness and ability to challenge 
unfavorable agency actions. In contrast to previous re- 
search, Gordon and Hafer see the motivation underly- 
ing political contributions as one of influencing the de- 
cisions of regulatory agencies rather than congressional 
enactments. The provocative findings of this analysis 
seem certain to add fuel to the already heated debate 
over the extent to which and the manner in which po- 
litical contributions influence policy outcomes. 

Like Gordon and Hafer, who obtain thought-pro- 
voking data from corporate lobbies, Michael Laver 
(“Policy and the Dynamics of Political Competition”) 
argues that political scientists can improve their theory 
by studying the “real world” of politics. Laver insists 
that political scientists must start treating parties as 
partisans do: as maelstroms of activity and change, 
not inert institutional fossils, especially when it comes 
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to modeling multi-party competition. Political parties 
have access to much more information (such as opin- 
ion polls) than is generally recognized, and it is up to 
political scientists to begin to represent these policy 
environments more accurately in their models of pol- 
itics. Laver develops different algorithms to allow for 
party adaptation in multi-party systems and applies his 
findings to the Irish party system in this insightful study. 

In the September 2000 issue, the APSR published 
“The Effects of Canvassing, Telephone Calls, and Di- 
rect Mail on Voter Turnout: A Field Experiment,” 
by Alan S. Gerber and Donald P. Green. Gerber 
and Green contended that face-to-face visits during 
the election campaigns they studied increased voter 
turnout substantially, that direct mail did so modestly, 
and that brief telephone calls were ineffective. In the 
“Forum” section of this issue, Kosuke Imai (“Do Get- 
Out-The- Vote Calls Reduce Turnout? The Importance 
of Statistical Methods for Field Experiments”) chal- 
lenges the data, methodology, and substantive results 
of the Gerber-Green study. In response (“Correction 
to Gerber and Green, Replication of Disputed Find- 
ings, and Reply to Imai”), Gerber and Green concede 
and correct some data problems but vigorously dispute 
Imai’s challenges to their methodology and substan- 
tive results. This exchange highlights the relationship 
between methods and results, and demonstrates how 
spirited debate can contribute to our understanding of 
political phenomena. 


INSTRUCTIONS TO CONTRIBUTORS 


General Considerations 


The APSR strives to publish scholarly research of 
exceptional merit, focusing on important issues and 
demonstrating the highest standards of excellence 
in conceptualization, exposition, methodology, and 
craftsmanship. Because the APSR reaches a diverse 
audience of scholars and practitioners, authors must 
demonstrate how their analysis illuminates a significant 
research problem, or answers an important research 
question, of general interest in political science. For the 
same reason, authors must strive for a presentation that 
will be understandable to as many scholars as possible, 
consistent with the nature of their material. 

The APSR publishes original work. Therefore, au- 
thors should not submit articles containing tables, 
figures, or substantial amounts of text that have al- 
ready been published or are forthcoming in other 
places, or that have been included in other manuscripts 
submitted for review to book publishers or periodicals 
(including on-line journals). In many such cases, sub- 
sequent publication of this material would violate the 
copyright of the other publisher. The APSR also does 
not consider papers that are currently under review 
by other journals or duplicate or overlap with parts of 
larger manuscripts that have been submitted to other 
publishers (including publishers of both books and 
periodicals), Submission of manuscripts substantially 
similar to those submitted or published elsewhere, or 


as part of a book or other larger work, is also strongly 
discouraged. If you have any questions about whether 
these policies apply in your particular case, you should 
discuss any such publications related to a submission in 
a cover letter to the Editor. You should also notify the 
Editor of any related submissions to other publishers, 
whether for book or periodical publication, that occur 
while a manuscript is under review by the APSR and 
which would fall within the scope of this policy. The 
Editor may request copies of related publications. 

If your manuscript contains quantitative evidence 
and analysis, you should describe your procedures 
in sufficient detail to permit reviewers to understand 
and evaluate what has been done and, in the event 
that the article is accepted for publication, to per- 
mit other scholars to carry out similar analyses on 
other data sets. For example, for surveys, at the least, 
sampling procedures, response rates, and question 
wordings should be given; you should calculate re- 
sponse rates according to one of the standard formulas 
given by the American Association for Public Opinion 
Research, Standard Definitions: Final Dispositions of 
Case Codes and Outcome Rates for Surveys (Ann 
Arbor, MI: AAPOR, 2000). This document is available 
on the Internet at <http:/Awww.aapor.org/default.asp? 
page = survey_methods/standards_and_best_practices/ 
standard _definitions>. For experiments, provide full 
descriptions of experimental protocols, methods of 
subject recruitment and selection, subject payments 
and debriefing procedures, and so on. Articles should 
be self-contained, so you should not simply refer read- 
ers to other publications for descriptions of these basic 
research procedures. 

Please indicate variables included in statistical anal- 
yses by capitalizing the first letter in the variable 
name and italicizing the entire variable name the first 
time each is mentioned in the text. You should also use 
the same names for variables in text and tables and, 
wherever possible, should avoid the use of acronyms 
and computer abbreviations when discussing variables 
in the text. All variables appearing in tables should 
have been mentioned in the text and the reason for 
their inclusion discussed. 

As part of the review process, you may be asked 
to submit additional documentation if procedures are 
not sufficiently clear; the review process works most 
efficiently if such information is given in the initial 
submission. If you advise readers that additional infor- 
mation is available, you should submit printed copies 
of that information with the manuscript. If the amount 
of this supplementary information is extensive, please 
inquire about alternate procedures. 

The APSR uses a double-blind review process. You 
should follow the guidelines for preparing anonymous 
copies in the Specific Procedures section below. 

Manuscripts that are largely or entirely critiques or 
commentaries on previously published APSR articles 
will be reviewed using the same general procedures as 
for other manuscripts, with one exception. In addition 
to the usual number of reviewers, such manuscripts will 
also be sent to the scholar(s) whose work is being crit- 
icized, in the same anonymous form that they are sent 
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to reviewers. Comments from the original author(s) to 
the Editor will be invited as a supplement to the advice 
of reviewers. This notice to the original author(s) is 
intended (1) to encourage review of the details of 
analyses or research procedures that might escape 
the notice of disinterested reviewers; (2) to enable 
prompt publication of critiques by supplying criticized 
authors with early notice of their existence and, there- 
fore, more adequate time to reply; and (3) as a courtesy 
to criticized authors. If you submit such a manuscript, 
you should therefore send as many additional copies of 
their manuscripts as will be required for this purpose. 
Manuscripts being submitted for publication should 
be sent to Lee Sigelman, Editor, American Politi- 
cal Science Review, Department of Political Science, 
The George Washington University, Washington, DC 
20052. Correspondence concerning manuscripts under 
review may be sent to the same address or e-mailed to 


apsr@gwu.edu. 


Manuscript Formatting 


Manuscripts should not be longer than 45 pages in- 
cluding text, all tables and figures, notes, references, 
and appendices. This page size guideline is based on the 
U.S. standard 8.5 x 11-inch paper; if you are submitting 
a manuscript printed on longer paper, you must adjust 
accordingly. The font size must be at least 11 points for 
all parts of the paper, including notes and references. 
The entire paper, including notes and references, must 
be double-spaced, with the sole exception of tables 
for which double-spacing would require a second page 
otherwise not needed. All pages should be numbered in 
one sequence, and text should be formatted using a nor- 
mal single column no wider than 6.5 inches, as is typical 
for manuscripts (rather than the double-column format 
of the published version of the APSR), and printed on 
one side of the page only. Include an abstract of no 
more than 150 words. The APSR style of embedded 
citations should be used, and there must be a sepa- 
rate list of references at the end of the manuscript. 
Do not use notes for simple citations. These specifi- 
cations are designed to make it easier for reviewers 
to read and evaluate papers. Papers not adhering to 
these guidelines are subject to being rejected without 
review. 

For submission and review purposes, you may place 
footnotes at the bottom of the pages instead of using 
endnotes, and you may locate tables and figures (on 
separate pages and only one to a page) approximately 
where they fall in the text. However, manuscripts ac- 
cepted for publication must be submitted with end- 
notes, and with tables and figures on separate pages at 
the back of the manuscript with standard indications of 
text placement, e.g., [Table 3 about here]. In deciding 
how to format your initial submission, please consider 
the necessity of making these changes if your paper 
is accepted. If your paper is accepted for publication, 
you will also be required to submit camera-ready copy 
of graphs or other types of figures. Instructions will be 
provided. 
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For specific formatting style of citations and refer- 
ences, please refer to articles in the most recent issue 
of the APSR. For unusual style or formatting issues, 
you should consult the latest edition of The Chicago 
Manual of Style. For review purposes, citations and 
references need not be in specific APSR format, 
although some generally accepted format should be 
used, and all citation and reference information should 
be provided. 


Specific Procedures 
Please follow these specific procedures for submission: 


1. You are invited to submit a list of scholars 
who would be appropriate reviewers of your 
manuscript. The Editor will refer to this list 
in selecting reviewers, though there obviously 
can be no guarantee that those you suggest will 
actually be chosen. Do not list anyone who has 
already commented on your paper or an earlier 
version of it, or any of your current or recent 
collaborators, institutional colleagues, mentors, 
students, or close friends. 

2. Submit five copies of manuscripts and a diskette 
or CD containing a pdf file of the anonymous 
version of the manuscript. If you cannot save 
the manuscript as a pdf, just send in the diskette 
or CD with the word-processed version. Please 
ensure that the paper and diskette or CD 
versions you submit are identical; the diskette 
or CD version should be of the anonymous 
copy (see below). Please review all pages of 
all copies to make sure that all copies contain 
all tables, figures, appendices, and bibliography 
mentioned in the manuscript and that all pages 
are legible. Label the diskette or CD clearly 
with the (first) author’s name and the title of 
the manuscript (in abridged form if need be), 
and identify the word processing program and 
operating system. If you are unable to create 
a diskette or CD, please note this in your 
submission, and you will be asked to e-mail the 
appropriate file. 

3. To comply with the APSR’s procedure of 
double-blind peer reviews, only one of the five 
copies submitted should be fully identified as 
to authorship and four should be in anonymous 
format. 

4. For anonymous copies, if it is important to the 
development of the paper that your previous 
publications be cited, please do this in a way that 
does not make the authorship of the submitted 
paper obvious. This is usually most easily 
accomplished by referring to yourself in the 
third person and including normal references 
to the work cited in the list of references. In no 
circumstances should your prior publications be 
included in the bibliography in their normal al- 
phabetical location but with your name deleted. 
Assuming that text references to your previous 
work are in the third person, you should include 
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Are Political Orientations Genetically Transmitted? 


JOHN R. ALFORD Rice University 


CAROLYN L. FUNK Virginia Commonwealth University 


JOHN R. HIBBING University of Nebraska 


e test the possibility that political attitudes and behaviors are the result of both environ- 

mental and genetic factors. Employing standard methodological approaches in behavioral 

genetics—specifically, comparisons of the differential correlations of the attitudes of monozy- 
gotic twins and dizygotic twins—we analyze data drawn from a large sample of twins in the United States, 
supplemented with findings from twins in Australia. The results indicate that genetics plays an important 
role in shaping political attitudes and ideologies but a more modest role in forming party identification, 
as such, they call for finer distinctions in theorizing about the sources of political attitudes. We conclude 
by urging political scientists to incorporate genetic influences, specifically interactions between genetic 
heritability and social environment, into models of political attitude formation. 


hy do people think and act politically in the 
manner they do? Despite the foundational 
nature of this question, answers are unfortu- 
nately incomplete and unnecessarily tentative, largely 
because political scientists do not take seriously the 
possibility of nonenvironmenta! influences. The sug- 
gestion that people could be born with political pre- 
dispositions strikes many as far-fetched, odd, even 
perverse. However, researchers in other disciplines— 
notably behavioral genetics—have uncovered a sub- 
stantial heritable component for many social attitudes 
and behaviors and it seems unlikely that political atti- 
tudes and behaviors are completely immune from such 
forces. In this article, we combine relevant findings in 
behavioral genetics with our own analysis of data on a 
large sample of twins to test the hypothesis that, con- 
trary to the assumptions embedded in political science 
research, political attitudes have genetic as well as en- 
vironmental causes. 

Testing this hypothesis is important for two reasons. 
First and most broadly, as behavioral scientists we need 
to analyze all possible shapers of behavior, not just a 
select few. Second, a more complete understanding of 
the sources of attitudes and behaviors will help us to 
sort through existing puzzles of considerable interest to 
political scientists. One example is political ideology. 
Why is a reasonably standard left-right spectrum so 
widely applicable cross-culturally and over time? The 
universal left-right elements of belief systems around 
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the world and over the decades is difficult for behavio- 
ralists to explain. But if there is a genetic component to 
political ideologies, if the constraints on belief systems 
come not just from intellectualization or indoctrination 
but from something deeper, the concept of ideology 
takes on greater meaning and the commonality of ide- 
ology becomes easier to understand. 


ATTITUDE FORMATION 


Debates concerning the source of political attitudes re- 
volve primarily around the question of whether early 
childhood factors have lasting relevance or whether 
these factors tend to be overwhelmed by more proxi- 
mate events. Survey responses to political items pre- 
sumably reflect attitudes and are thought to be a 
combination of longstanding “predispositions” and 
more recent “off-the-top-of-the-head” considerations 
(Zaller 1992, chaps 1-3; also see Converse 1964). Al- 
ternatively, an “on-line” pattern of processing could 
allow new incidents to ratchet affect one way or an- 
other from previously existing summary locations (see 
Lodge, McGraw, and Stroh 1989). Regardless, proxi- 
mate forces include recent conversations and experi- 
ences, question-wording, priming from previous ques- 
tions, and a variety of similar factors. Predispositions, 
on the other hand, are thought to be a “distillation 
of a person’s lifetime experiences, including childhood 
socialization and direct involvement with the raw in- 
gredients of policy issues” (Zaller 1992, 23).* Great 
interest exists in determining the relative clout of the 
early as opposed to the late environment but no interest 
has been displayed in determining the relative clout of 
environmental as opposed to genetic variables. 

A parallel conclusion applies to research on individ- 
ual attitudes rather than survey responses generally. 
For example, the consensus among those who study 
tolerance is that the extent to which individuals are 
tolerant hinges on a combination of “antecedent con- 
ditions and contemporary information” (Marcus et al. 
1995). Antecedent conditions, in turn, are believed to 


2 To his credit, Zaller (1992) goes on to acknowledge a possible role 
for “inherited” traits in shaping predispositions (23) 
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be shaped by “personal circumstances” such as “family, 
neighborhood, region... and early group experiences” 
(Marcus et al. 1995, 5; for more on the importance of 
long established proclivities, or antecedent conditions, 
see Stouffer 1955). Typically, no role for genetically- 
induced tendencies is considered (for an exception, see 
Monroe 2004, chap 6). 

More broadly, the literature on political socializa- 
tion has long revolved around the question of the ef- 
fects of early as opposed to late environmental forces. 
Early political socialization researchers (e.g., Easton 
and Dennis 1969, Greenstein 1960, Jennings and Niemi 
1968, and Searing, Schwartz, and Lind 1973) and the 
authors of The American Voter (Campbell et al. 1960) 
presented arguments and evidence supporting the pri- 
macy of early events. Later researchers, however, ques- 
tioned the value of early childhood socialization and 
provided evidence that judgments about more recent 
conditions and occurrences can dramatically alter pref- 
erences we might have held as children and adoles- 
cents (see, e.g., Fiorina 1980; for good summaries of the 
debate over the relative importance of early and late 
environmental events, see Cook 1985; Merelman 1986, 
and Sears 1989). In the last 50—60 years, the empha- 
sis in the literature has gone from personality studies 
(Adorno et al. 1950; Eysenck 1954; Laswell 1930), to 
ideological and childhood socialization studies, to the 
effects of media frames, perceptions of current condi- 
tions, and other types of contemporary information. In 
fact, for the past couple of decades research on polit- 
ical socialization has been suffermg through a “bear 
market” (Cook 1985), and studies of personality, while 
experiencing a remarkable comeback in psychology 
(for an introduction, see Wiggins and Trapnell 1997), 
have been largely absent from political science since 
McCloskey’s (1958) work in the 1950s on the con- 
servative personality. Thus, political science debates 
concerning the source of political attitudes and be- 
haviors have been over timing, over whether attitudes 
and behaviors are primarily shaped early in life or by 
more proximate occurrences. Conspicuously absent is 
consideration of the possibility that certain attitudes 
and behaviors may be at least partially attributable to 
genetic factors. 


MODERN BEHAVIORAL GENETICS 


But what is the physical process by which a genetic 
allele could shape a political attitude? If there is any 
connection at all, is it not that the effect is so small 
that it can be safely ignored? And even if this 1s not 
the case, in light of potentially troubling normative 
implications such as biological determinism, is it not 
best to ignore relationships between genes and social 
behavior? It is difficult for many outside the biological 
sciences to understand how it is even possible for genes 
to influence behavior, so a brief discussion is in order. 
Genes provide instructions for the production of pro- 
teins, which are built and identified by a specific com- 
bination of amino acids (which in turn are constructed 
from complex organic molecules). As such, each pro- 
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tein has a chemical sequence that then interacts with 
other chemicals in the body, sometimes reacting di- 
rectly with these other chemicals but often serving as 
enzymes that facilitate but are not themselves altered 
by chemical reactions. If a gene coding for a particular 
enzyme is absent, the chemical reaction it is meant to 
enhance will occur with much less efficiency. For exam- 
ple, a gene for the enzyme tryptophan hydroxylase-2 
(Tph2) facilitates production of the neurotransmitter 
serotonin in the brain, but a certain form of this gene 
(which varies from the standard form by a single amino 
acid) produces about 80% less serotonin and people 
with this mutant allele appear to be significantly more 
likely to suffer from unipolar depression (Zhang et al. 
2005). 

Still, the connection is rarely so simple that a given 
genetic allele can be seen as causing a certain behavior. 
More typically, findings in modern behavioral genetics 
reveal the effect of genes to be interactive rather than 
direct, let alone determinative. To provide one illus- 
tration, in humans there is a gene on chromosome 17 
involved with serotonin reuptake (5-HTT). As is of- 
ten the case with genes, 5-HTT has a long allele and 
a short allele. Mice have a parallel gene, and in that 
species the short form had previously been connected 
to listless, depressive behavior. Scientists were eager to 
determine if such a correlation between the short form 
of 5-HTT and depression was present in humans. In a 
long-term study of the health records of nearly 1,000 
New Zealanders whose 5-HTT alleles were known, it 
was found that major episodes of depressive behavior 
were not much more prevalent among those with the 
short form. But then the researchers combined genetics 
and the environment; specifically, they interacted each 
subject’s 5-HTT allele with the number of high-stress 
events (romantic calamities, bankruptcies, deaths of 
loved ones, etc.) experienced in that individual’s life. 
They found that those who had a high number of such 
events and who had the short form of 5-HIT were 
significantly more likely to display behaviors associated 
with depression compared to either those experiencing 
few high stress events or those with the long form who 
suffered through a comparably large number of high- 
stress events (see Caspi et al. 2003). 

In this particular case, genotype did not make peo- 
ple behave a certain way; rather, it influenced the ex- 
tent to which their behavior was contingent on the 
environment—and this pattern likely will apply to all 
sorts of other human activities. Whether the behavior 
of interest is depression, cooperation, fear response, or 
susceptibility to drug addiction, some people are more 
sensitive than others to particular features of their en- 
vironment, and genetics, far from determining behav- 
ior, influences its sensitivity. Genetics makes the mood 
of some people far more dependent on the extent to 
which their lives have been beset with difficulties and it 
likely makes some people’s political attitudes far more 
contextually dependent than others. In other words, 
the connection between genes and attitudes may not 
involve specific attitudes as much as the flexibility of 
those attitudes (Is abortion always wrong, or does it 
depend?). The issue is not nature versus nurture but 
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the manner in which nature interacts with nurture (see 
Marcus 2004 and Ridley 2003). 


MONOZYGOTIC AND DIZYGOTIC TWINS 


The process of identifying in the laboratory the pre- 
cise genes responsible for given human behaviors 
(especially those behaviors that do not have corollar- 
ies in lab-friendly animals such as mice) is extremely 
challenging. Fortunately, even without identifying the 
genes responsible, it is possible to compile informa- 
tion on the matter of most concern to social scien- 
tists: the extent to which attitudes and behaviors have 
a genetic component. The relevant procedures center 
on comparisons of monozygotic (MZ; frequently but 
erroneously called identical) twins and dizygotic (DZ; 
fraternal) twins. 

MZ, twins develop from a single egg, fertilized by 
a single sperm, and share an identical genetic inheri- 
tance. DZ twins develop from two separate eggs, fer- 
tilized by two separate sperm, and are in effect simply 
two siblings that happen to be born simultaneously. As 
such, DZ twins share the same average of 50% of ge- 
netic material as do any two biological siblings. It is this 
fixed ratio (two to one) of genetic similarity between 
MZ and DZ twins, and the contrasting average equiv- 
alence of environment influence, that provides most of 
the power of twin designs. It is important to appreciate 
that the assumption of environmental equivalence is 
one of equivalence across types of twins, not across 
pairs of twins or across twins within a given pair. For 
example, there is undoubtedly at least some variability 
in parental socialization across siblings, even those of 
identical age, but across multiple twin pairs the assump- 
tion is that this variability is essentially equal for the 
MZ and the DZ pairs. 

This assertion that the effect of genetics is measur- 
ably distinct for MZ and DZ twins, while the effect of 
the environment is either equivalent or at least ran- 
domly distributed around equivalence, is crucial to ev- 
erything that follows from twin research. It is important 
therefore to raise and consider the criticisms of this 
fundamental assumption. The arguments come in two 
essential varieties. The first is that MZ twins, genetics 
aside, experience a more similar environment because 
they are treated more similarly than are DZ twins. This 
would seem particularly telling for childhood socializa- 
tion, where, for example, parents might show less of a 
tendency to treat MZ twins as individuals compared 
to DZ twins. The second is that MZ twins, genetics 
aside, interact with each other more throughout life 
than do DZ twins. This would seem to be of particular 
importance for adult socialization, where closer adult 
contact between MZ twins might lead us to expect a 
greater degree of environmentally induced similarity 
than we would see for the more distant DZ twins. 

Both caveats have been subject to sustained and var- 
ied investigation and neither has been found to hold up 
under empirical scrutiny. The argument of more similar 
treatment fails on several fronts. Parents frequently 
miscategorize their twins (DZ twins are often believed 
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by their parents to be MZ twins) and the differential 
correlation persists in these instances of miscategoriza- 
tion. In other words, the degree of correspondence be- 
tween MZ twins surpasses that of DZ twins even in the 
large subpopulation of twins thought by their parents 
to be MZ twins (Bouchard and McGue 2003; Bouchard 
et al. 1990; Plomin 1990). The contention that MZ 
twins have closer or more frequent contact than DZ 
twins turns out to be at best irrelevant. The correla- 
tion between the frequency of contact between twins 
and the similarity between twins on all attitudinal and 
behavioral variables tested, including conservatism, is 
slight and actually negative (Martin et al. 1986). In 
other words, twins in greater contact with their cotwins 
are not more likely to share the same attitudes and 
behaviors, so even if MZ twins have more contact than 
DZ twins, this contact is not the cause of any elevated 
correlations. But the most powerful refutation of both 
of these criticisms comes in recent studies utilizing MZ, 
and DZ twins raised apart. These studies uniformly val- 
idate MZ and DZ differences found in earlier studies 
of twins raised together. Arguments about the relative 
degree of shared environmental effects between MZ 
and DZ twins simply offer no credible explanation if 
the twins in question have been raised apart (Bouchard 
1998; Bouchard et al 1990). In effect, this naturally 
occurring, if uncommon, condition provides precisely 
the sort of laboratory control that we would want in an 
experimental setting.’ 

Other evidence against the exclusive environmental 
argument is that the empirical results suggest MZ twins 
reared together are often less likely to share behavioral 
traits with their twins than are MZ twins reared apart, 
presumably because of extra efforts to establish distinct 
identities when the twins live together. In addition, as 
adult MZ twins living apart age, they tend to become 
more, not less, similar (Bouchard and McGue 2003), a 
finding that is difficult to reconcile with the belief that 
only the environment matters. Interestingly, this pre- 
cise effect is predicted in an early landmark criticism of 
behaviorism and the conditioned response research on 
animal behavior that formed its empirical core. Over 
time, substantial anomalies began to accumulate in this 
research pointing toward a primacy for some nonen- 
vironmental behaviors. Breland and Breland (1961) 
summarized this tendency with the phrase “learned 
behavior drifts toward instinctive behavior” (684). 

Given the genetic differences and environmental 
similarities of the two types of twins, for any trait 
that is partly heritable the tendency for MZ twins to 
share that characteristic should be stronger than the 
tendency for DZ twins to share that characteristic. 
In contrast, characteristics that arise purely from the 
environment, whether shared by the twins, as would 
typically be the case for parental socialization, or not 


3 To explain this finding, opponents would need to argue that adop- 


tion agencies are more likely to place MZ twins in similar homes 
than they are to place DZ twins in similar homes. In fact, information 
on twin zygosity 1s typically unavailable to those making placement 
dectsions, and even if ıt were available, it seems highly unlikely that 
it would factor mto therr decisions. 
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shared by the twins, as would be the case for many 
adult experiences, should not generate any significantly 
different patterns when we contrast MZ and DZ twins 
(see Eaves, Eysenck, and Martin 1989 and Plomin et al. 
2001 for a thorough discussion of the relevant statistical 
techniques). 

The procedures involved with the twin methodol- 
ogy are standard fare in behavioral genetics but are 
not familiar to most political scientists, so it is appro- 
priate that we explain the basic terminology, theory, 
and technique 1n some detail. Influences on an individ- 
ual trait, whether it is a political attitude or a physi- 
cal characteristic, are typically divided into two broad 
groups—heredity (H) and environment (E). The total 
variation in a trait can thus be represented as the sum 
H+ E. Heredity is the impact of genetic inheritance on 
trait variation. In the case of a physical characteristic 
such as adult height, this would be the proportion of the 
total variation in height across individuals due to the 
variation across individuals in the multiple genes that 
control ultimate physical height. For any one individ- 
ual, the source of this genetic influence is relatively well 
defined, as on average 50% of our genes come from our 
mother and 50% come from our father. This leads to 
the fact that biological children of tall parents are more 
likely to be tall than are the biological children of short 
parents, though even for a relatively straightforward 
additive physical trait like height, the relationship is 
far from determinative. 

“Environment” is all of the nongenetic external fac- 
tors that influence trait variation across a population. 
These influences range broadly from the earliest bio- 
logical environment of the womb, to the physical en- 
vironment of a childhood house, to the social environ- 
ment of the adult workplace. In the case of adult height, 
some of the obvious environmental factors are prenatal 
nutrition, the adequacy of childhood and adolescent 
diet, and exposure to chemical agents that can inhibit 
growth. 

Environmental influences can be further divided into 
two subcategories: the shared environment and the 
unshared, or unique, environment. The shared envi- 
ronment is all of the shared external influences that 
we would typically think of as leading to trait similar- 
ity between individuals. Siblings, for example, might 
share similar childhood environments, including sim- 
ilar parental interactions, a similar physical environ- 
ment, and similar nutrition. If the siblings happen to 
be twins, they would also share a more similar prenatal 
environment.‘ In the case of adult height, a shared 
environmental factor, such as a regional diet limited in 
protein and specific nutrients, could lead to similarity 
in height across the entire population of a region. 

The unshared environment is all of the distinctive 
external influences that we would typically think of as 


4 However, recent research suggests that the prenatal environment 
is so important that it can cause variation even m fetuses inhab- 
iting the uterus at the same time. Prescott, Johnson, and McArdle 
(1999) present evidence that MZ twins sharing the same chorion, the 
outermost extraembryonic membrane, are more similar m terms of 
personality and cognitive abilities that MZ twins m separate chorions 


156 


May 2005 


leading to trait dissimilarity across individuals. While 
much of the early childhood environment, for example, 
is similar across siblings, much is nonetheless variable. 
Siblings differ in diet, disease exposure, peer influences, 
and a host of other unique experiences. Even twins, 
whose childhood environment is made increasingly 
similar by virtue of their identical age, are exposed to 
substantial unique external influences. With the shift to 
adult life, the share of unique influences on siblings in- 
creases sharply, as peer, workplace, family, and physical 
settings typically diverge. 

In the classic political science studies of socializa- 
tion (see, especially, Jennings and Niemi 1968, 1991 
and Tedin 1974), the focus has been on the correlation 
between the attitudes of parents and their children. In 
terms of the three sources of trait variability outlined 
above, as informative as it is, this design does not allow 
for an unambiguous estimation of any of the three cat- 
egories. The correlation between a parent and a child 
arises from a combination of shared genes, shared en- 
vironment, and parental socialization (an indirect form 
of shared environment in which the parent’s attitudes 
provide a path from the parent’s environment to the 
child’s environment), all of which are pressures toward 
similarity in parent-child attitudes. The failure of this 
parent-child correspondence to reach +1.0 presum- 
ably reflects the pressure toward dissimilarity coming 
from the unshared environment, but since the genetic 
similarity of a parent—offspring pair is only .5, there 
is as much genetic dissimilarity as there is similarity. 
Thus, trait dissimilarity, like trait similarity, is an unde- 
termined mixture of genetic and environmental influ- 
ences. Our inability to tease apart genetic heritability 
and environment, whether shared or unshared, in these 
parent-child studies is a direct result of the fact that 
there is no measured variation in genetic similarity 
across the data set of parent-child pairs (ie., all bi- 
ological offspring share the same average of 50% of 
the variable genetic code with each parent). 

This inability of standard parent-child observations 
to distinguish genetic heritability from parental social- 
ization (or other features of the shared environment) is 
something that has long been understood, but largely 
ignored in modern social science. Fortunately, twins 
provide a powerful “natural experiment” by introduc- 
ing known genetic variation into analyses of the sources 
of trait variability. By shifting the focus from the sim- 
ilarity between parents and offspring to the similarity 
between two siblings, we can take advantage of the 
fact that some siblings vary in well-known ways in the 
degree of their genetic correlation. 


POLITICS AND GENETICS: PREVIOUS 
FINDINGS AND OUR EXPECTATIONS 


Comparisons of the correlations of MZ and DZ twins 
on a wide variety of variables have been conducted, 
with intriguing results. Using appropriate modeling 
techniques including controls for parental traits and 
assortative mating, it is possible to partition the ex- 
planatory powers of heredity, shared environment, and 
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nonshared environment on any given variable. These 
techniques have been valuable for epidemiological 
traits, intelligence, personality, social attitudes such as 
those connected to religion, psychological interests, 
and behaviors such as risk-taking propensities (for a 
thorough review, see Bouchard and McGue 2003). Of 
most interest to us are the findings pertaining to social 
attitudes and behaviors. At first, researchers were so 
confident that social attitudes were not heritable that 
they employed such items as controls. Quickly they 
discovered that other controls would have to be found 
because most social attitudes consistently displayed 
a surprising measure of heritability (see, e.g., Crelia 
and Tesser 1996, Scarr and Weinberg 1981, and Tesser 
1993). 

Political attitudes were never a central focus in this 
research stream but many of the patterns found in other 
social attitudes should be present for political attitudes 
as well, and this assumption guided the formulation 
of our expectations. Since the social attitudes tested 
to date have demonstrated a strong heritable compo- 
nent, frequently stronger than attitude covariance at- 
tributable to shared environment, we predict that polit- 
ical attitudes will also be heavily heritable. Heritability 
estimates calculated by previous researchers for atti- 
tudes associated with psychological conservatism are 
quite high, while the relevant models typically show 
little or no effect for shared environment (the remain- 
der is likely the result of nonshared environmental 
factors). Notably, these findings come from studies of 
twins in settings as disparate as Australia, Virginia, and 
Minnesota, and the findings of the Minnesota study, 
utilizing twins reared apart, conform well to the 
other studies of twins raised together (for a summary, 
see Bouchard and McGue 2003).° Careful studies of 
adopted children confirm the finding that genetics mat- 
ter more than parentally created environment in influ- 
encing social attitudes and behaviors, personality traits, 
and intelligence.® 

We further predict that attitudes on political is- 
sues tracking most closely to central personality traits 
should be the most heritable since personality traits 
are generally heritable and since the heritability of so- 
cial attitudes is likely derivative of the heritability of 
various personality traits (see Bouchard and Loehlin 


3 Conservatism is not unusual in this regard. Rushton, Littlefield, and 
Lumsden (1986, 7340) find that approximately 50% of the vanance 
in altruism is the result of “direct genetic inheritance,” with family 
environment responsible for 0%. 

6 Adoption studies measure the correlation of biological parents and 
adopted children where the biological parents have had no contribu- 
tion to the rearing (environment) of the child. The most recent adop- 
tion study, utilizing surveys of Korean-American adoptees randomly 
assigned to families in the United States, concludes that roughly 75% 
of variance in children’s educational attainment is attributable to the 
educational attainment of their biological parents, and only 25% is 
attributable to the adoptive parents, thus dramatically 

the earlier findings of a substantial correlation between biological 
parents and adopted children and a surprisingly paltry correlation 
between adoptive parents and children (Sacerdote 2004) This par- 

allels, with an entirely distinct methodology, the basic finding of the 
twin studies (see Plomin et al. 1997, 1998 and Rhee and Waldman 
2002) 
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2001 and Eaves, Eysenck, and Martin 1989). For ex- 
ample, one of psychology’s “Big 5” personality traits is 
general “openness” and it seems likely degree of open- 
ness is relevant to the political arena as well. Liberals 
and conservatives, on average, differ in their openness 
to atheism, homosexuality, communism, immigration, 
and countercultural activities. These differences may 
be entirely due to enculturation, but then again, they 
may not be, and we will never know without testing for 
the effects of genetics. 

Based on behavioral geneticists’ study of religion, 
it seems that group identification is something that 
is heavily influenced by the environment, especially 
shared environment, and is mostly unconnected to ge- 
netics. Children of Methodists are likely to be Metho- 
dists not because there is a gene for Methodism or even 
a personality particularly oriented toward Methodism, 
but because of parental socialization. Thus, even as atti- 
tudes connected to religiosity and religious beliefs and 
activities (e.g., Sabbath observance, church authority, 
belief in heaven, religious fundamentalism, frequency 
of attendance) were found to be shaped more by ge- 
netic inheritance than by parental views on those issues 
(for details, see Bouchard et al. 1999, Eaves, Martin, 
and Heath 1990, Maes et al. 1999, and Martin et al. 
1999), identification with a particular religious group 
was shaped more by socialization and almost not at 
all by genetics. We expect to find a similar pattern 
with political party identification. Children are eager 
to belong to the groups their parents belong to and 
parents are frequently eager to encourage children in 
this regard. Assuming these identifications have some 
stickiness into early adulthood, our core expectation 
is that party identification will be influenced more by 
parental socialization (shared environment) than by 
genetic inheritance but that this pattern will be re- 
versed for political attitudes with inheritance playing 
a role at least as large as the shared environment. By 
predicting a large influence for genetic inheritance, we 
depart from typical behavioralist expectations antic- 
ipating that political attitudes will be predominantly 
influenced by environmental factors, rendering genetic 
inheritance largely, if not completely, inconsequential. 


DATA AND METHODS 


Since twin studies have not been conducted by po- 
litical scientists, political attitudes have been at best 
a sidelight, and properly refined measures of political 
variables have not been constructed and employed (the 
heritability of political behavior has not been analyzed 
at all). Nonetheless, some previously employed vari- 
ables in twin studies have political relevance. For ex- 
ample, the heritability of conservatism is frequently as- 
sessed (see, e.g., Bouchard et al. 1990, Eaves, Eysenck, 
and Martin, 1989, and Martin et al. 1986), and even 
though conservatism is viewed by the scholars who 
do twin studies more as a psychological trait than a 
political ideology, measures of it include political items. 

Of most relevance here is the Wilson—Patterson 
(W-P) Attitude Inventory. This inventory is admin- 
istered by presenting subjects with a short stimulus 
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phrase such as death penalty or royalty and eliciting 
a simple agree, disagree, or uncertain response. The 
broadest version of the W-P inventory includes 50 
items, 25 of which contribute positively to the con- 
servatism score and 25 of which contribute negatively 
to the conservatism score. While some of the items re- 
late to a heavily social conception of conservatism—for 
example, pajama parties, nudist camps, computer mu- 
sic, and horoscopes—others have a much more direct 
political content—for example, disarmament, social- 
ism, patriotism, and death penalty. Studies typically 
utilize reduced sets of W-P items or modify individual 
items to better suit the country in which the items are 
being administered. For political science this presents 
two frustrations. The list of politically relevant items 
is tantalizing but limited and unfocused, and the re- 
sults are often presented only for the entire combined 
scale, making it difficult to assess the contribution 
of the directly political items to the overall index of 
heritability. 

We were granted access to the data for the W-P items 
in the United States and were able to conduct compa- 
rable, though more limited, twin correlation analyses 
from published results of an Australian study.’ The U.S. 
study included information on thousands of twin pairs 
in Virginia, supplemented with twin pairs recruited 
through the cooperation of AARP. A subset of these 
twins and their relatives has been asked questions re- 
garding their social attitudes, including numerous items 
from the W-P inventory. 

A brief explication of twin methodology should help 
readers make independent sense of the tables. The 
standard techniques in behavioral genetics are based 
on correlation analysis (in the case of limited response 
items like the W-P inventory, the actual measure is the 
polychoric correlation coefficient, a technique that is 
appropriate when individual subjects are using a lim- 
ited set of categories to express location on what is in 


7 Our thanks go to Professor Lmdon Eaves at Virginia Common- 
wealth University for making the VA30K data available to us The 
data collection methods for both studies are summarized ın Lake 
et al. 2000 as follows: “The Australian sample was ascertained 
through two cohorts of twins. The first cohort was recruited in 1980- 
1982 from a sampling frame which comprised 5967 twin pairs aged 
18 years or older (born 1893 to 1964) then enrolled on the Aus- 
tralian NHMRC Twin Registry (ATR). Responses were obtamed 
from 3808 complete pairs... and these were followed up with a sec- 
ond mailed questionnaire in 1988-1990 with responses from 2708 
complete pairs.. . The second cohort of twins, born 1964-1971, was 
recruited from the ATR in 1989 and was mailed similar question- 
naires in 1989-1991, with responses from 3,769 individuals of 4269 
eligible pairs.... In total there were 21,222 respondents in the Aus- 
trahan sample, of whom 20,945 had valid scores for EPQ Neurotl- 
cism. The United States twins were ascertained from a population- 
based birth registry for the Commonwealth of Virginia and from 
a volunteer sample through the American Association of Retred 
Persons (AARP), described in detail by Truett et aL (1994). Thear 
first-degree relatives and spouses were recruited m a similar fashion 
to the Australian sample, and in total there were 24,905 respondents 
(of 29,080) with valid scores for Neuroticism and for whom the zy- 
gosity of the proband twms could be determined. The response rates 
were 70% for twins and 45% for relatives” (224-25). The ongmal 
U.S. twin data collection was funded in part by NIH grants GM30250 
and AG04954, by ADAMHA grants AA06781, AA07728, AA07535, 
and MH40828, and by a gift from R J R Nabisco. 
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fact a continuous trait). The correlations are computed 
separately for male/male and female/female twin pairs 
to provide an appropriate comparison, since all MZ 
twins are same-sex pairs, while DZ twins are a mix 
of same-sex and opposite-sex pairs (in other words, 
female/male DZ twin pairs are excluded from the anal- 
ysis). Without this control, the presence of any male/ 
female differences would spuriously deflate the cor- 
relations for DZ pairs relative to the same-sex MZ 
pairs. 

Heritability is typically estimated by subtracting the 
correlation for DZ pairs from the correlation for MZ 
pairs and then doubling the resulting difference. At one 
extreme, if the correlations are the same for MZ and 
DZ pairs, suggesting that genetic similarity plays no 
role in similarity for that particular trait, then the result 
will be an estimate of heritability of zero. At the other 
extreme, a purely genetic additive trait should produce 
a correlation of .5 for DZ pairs and 1.0 for MZ pairs, re- 
sulting in an estimate of heritability of 1.0 (1.0 — .5=.5, 
and 2 *.5= 1.0). In a similar way, we can estimate the 
influence of shared environment, as opposed to shared 
genetic material, by doubling the correlation for DZ 
pairs and then subtracting the correlation for MZ pairs. 
Again, a purely genetic additive trait should produce 
a correlation of .5 for DZ pairs and 1.0 for MZ pairs, 
resulting in an estimate of the impact of shared en- 
vironment of zero (2*.5=1.0, and 1.0—1.0=0). At 
the other extreme, if the correlations are the same for 
MZ and DZ pairs, suggesting that genetic similarity 
plays no role in similarity for that particular trait, then 
the result will be an estimate of the impact of shared 
environment that is equal to the MZ or DZ correlation 
(e.g., if MZ = DZ = .4, then 2*.4=.8, and .8 — .4 = .4). 
Whatever is left over is taken to be attributable to the 
unshared environment. 


THE HERITABILITY OF POLITICAL 
ATTITUDES 


Table 1 contains the results of a standard polychoric 
correlation analysis for the 28 W-P items available in 
the Virginia 30K data set and for a select set of addi- 
tional items to provide some sense of perspective for 
the level of these correlations. Even the quickest glance 
at the results in Table 1 is enough to set aside the tradi- 
tional view that genes do not play any role in explaining 
political attitudes. All 28 of the MZ correlations are 
larger than their corresponding DZ correlations, and 
in every case the difference is statistically significant at 
the .01 level. Far from typically being at or near zero, 
none of the 28 heritability estimates falls in the single 
digit range, and more than half of the 28 items have 
heritability estimates of .3 or more. Heritability ranges 
from a high of .41 to a low of .18, all suggesting that the 
influence of heredity on political attitudes is very real, 
and given the diverse range of items included here, 
this genetic influence is also pervasive. So the view 
that heritability of social and political attitudes will be 
nonzero but small relative to shared environment is 
also called into question. We see from Table 1 that the 
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TABLE 1. 
Wllson—Patterson Items 


Genetic and Environmental Influences on Political Attitudes: The 28 Individual 


Polychoric Correlation 


MZ DZ 


re ae 
NN 
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SBANN 


Jesixe 


Women’s Liberation 
Death Penalty 
Censorship 
Living Together 
Miltary Drill 
Gay Rights 
Segregation 
Busing 

Nuclear Power 
Democrats 
Divorce 


a4 
— Ovo 


SELLE: 


Shared Unshared 


0.25 0.34 9.83 
0.06 0.53 7.66 
0.03 0.58 
0.14 0.47 

0.52 


Abortion 
Modem Art 
Federal Housing 
Liberals 


0.25 
0.25 
0.20 
0.18 
0.32 


pnncipal investigators, Virginia 30K twin study (see note 7). 
difference ts statistically significant for all of the table tems at the 0.01 level or above. 


SFIERE 


{ 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1,77 
1 
1 
1 
1 
1 
1 
i 
1 
1 
1 
1 
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impact of shared environment exceeds that of heredity 
for only four of the 28 items, and the mean estimate of 
heritability for the 28 W-P items is .32, compared to 
a mean estimate of shared environmental influence of 
.16. 

The second-to-last column in Table 1 reports the esti- 
mates for the proportion of the variation in an attitude 
that is attributable to the unshared environment. As 
described above this is essentially a residual variance 
category, reflecting such factors as random choice as 
well as external influences such as the unique expe- 
rience of each individual, including those from child- 
hood, and later influences in life that have been termed 
“adult socialization” in the political science literature. 
Across the 28 W-P items the estimate of the impact 
of unshared environment varies from about one-third 
(for School Prayer) up to about two-thirds (for Paci- 
fism) of the overall variation. The average impact of the 
unshared environment for these items is .53, or roughly 
half of the overall variation. The summary picture for 
this set of political attitudes, then, is that shared influ- 
ences (genetic and environmental) account for about 
half of the variation in these political reactions, with 
unique individual and environmental factors account- 





ing for the remainder. Within the half that is accounted 
for by shared influences, genetic influences, in contra- 
diction to behavioralist expectations, are roughly twice 
as influential as environmental influences. 

While the individual items provide interesting vari- 
ation, the purpose of the W-P inventory is to provide 
an overall index of conservatism. We compute a simple 
index by assigning a value of +1 to any “conservative” 
response (i.e., a “yes” to an item like Death Penalty 
or a “no” to an item like Women’s Liberation) and —1 
to any “liberal” response (i.e., a “no” to an item like 
Death Penalty or a “yes” to an item like Women’s 
Liberation). Items where the respondent chose a non- 
commital (?) response are coded as zero. When these 
individual scores are summed across the 28 items they 
yield an index that varies from a potential low of —28 
(indicating a set of uniformly “liberal” responses) to 
a high of +28 (indicating a set of uniformly “conser- 
vative” responses). The actual index scores for the 
twins in the study range from —26 to +26, with the 
median response falling between +2 and +3. Given 
the far more continuous nature of this overall index, 
we can now utilize the more traditional Pearson’s cor- 
relation coefficient. The results for the overall index 
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TABLE 2. Genetic and Environmental Influences on Polltical Attttudes: Summary Index and 
Additional Non—Wilson—Patterson Items 


ae la ca a a ŘŮĖŮĖ 


Correlation 


DZ Shared 
Heritabilrty, 


Unshared 
Environment, Environment, zfor (MZ—DZ) 


Atttude Hem f n 1— MZ Difference’ 


Pearson’s correlation coeffictent 
28-rtem Index score 


2%(MZ—DZ) (2*DZ)—MZ 


893 
3.08 


1,384 0.43 0.35 
131 l 0.36 


Oplnionation 
Polychoric correlaton coefficient 
28-iItem mean 
Educational Attalnment 
Party Affillatton 
Mean of affect toward Reps 


1,384 


1,748 
1,771 
1,554 
1,730 


and Dems. 


O WA UWB o o I Im Im M IU I M ammm ammm 
Source Access to the data provided by Eaves et al , pnncipal investigators, Virginia 30K twin study (see note 7) 
“The MZ—DZ correlation difference ls statistcally significant for all of the table Items at the 0.01 level or above 


are presented in Table 2 and clearly support a powerful 
role for heredity in influencing conservatism, at least 
as measured by the W-P inventory. The estimate for 
heritability is .43, higher than for any of the individ- 
ual items. The estimate for shared environment is .22, 
falling within the upper range of the individual items, 
while the estimate for unshared environment is only 
35, falling very near the bottom of the range for indi- 
vidual items. The overall picture is again a very strong 
role for heredity and a less powerful, but clear role for 
shared environment. What is different for the overall 
index is that the role of shared influences (genetic and 
environmental) account for almost two-thirds of the 
variation in the index (compared to about one-half 
for the individual items), with unique individual and 
environmental factors accounting for only about one- 
third of the variation. This decline in the role of unique 
individual and environmental factors seems sensible, 
as we are moving from individual and highly specific 
items that could involve a host of unique experiential, 
associational, and informational perturbations to an 
index where those idiosyncratic features of individual 
items have the opportunity to cancel each other out. 
The W-P items can also be used to construct a rough 
index of political opinionation by taking advantage of 
the frequency of ? responses. The number of times 
that a respondent chose a yes or no response Over a 
neutral ? response was summed to produce an index 
that varies from zero to 28, with a 28 indicating that the 
respondent was willing to express a directional opinion 
on all 28 items and a score of zero indicating that the 
respondent was unwilling to offer a directional opin- 
ion on any of the 28 items. The median for this index 
is 21 yes or no response choices of 28 possible. ‘The 
results for the overall index clearly support a power- 
ful role for heredity in influencing political opiniona- 
tion, at least as it is captured by the admitted rough 
gauge of the frequency of nonneutral responses to the 
W-P inventory items. The estimate for heritability is 
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36 and the estimate for shared environment is only 
02. The estimate for unshared environment is high, 
at .61, falling near the top of the range for individual 
items. To the extent that there is a family effect on 
political opinionation, it would appear to be entirely a 
genetic one, with the remaining roughly two-thirds of 
the variation being due to nonshared factors. 

Two items from the survey that are not a part of the 
W-P inventory are included in Table 2. Party affiliation 
is the most clearly political of the items in the broader 
questionnaire, and it is useful here on its own, as well 
as in contrast to the attitudinal items. Party identifi- 
cation is distinct among U.S. political attitudes both 
in our conception of it as an identification, and hence 
as something at least potentially distinct from simple 
item evaluation, and in its established tendency to cor- 
relate well between parent and child (see Jennings and 
Niemi 1968). This distinctiveness is apparent in Table 2. 
As we expected, the pattern for party identification is 
nearly the exact reverse of that for the average attitude 
item. Heritability for party affiliation is relatively low 
(r=.14), while shared environment is much stronger 
(r=.41). Note also that not one of the 28 W-P items 
has an average heritability that is as low as that for party 
affiliation, and likewise, not one of the 28 items has an 
average coefficient for the impact of shared environ- 
ment that is as high as that for party affiliation. Clearly, 
party identification is, at least for the United States, a 
different sort of beast than reactions to issue items. 

In this regard it is particularly interesting that the 
two major parties also appear in the W-P battery, but 
here they are objects of affect rather than labels of pos- 
sible identification, and the “pro” or “con” reactions to 
the parties that these items pick up do not exhibit the 
same patterns of genetic and environmental influence 
that we see for party affiliation. In fact, if we average 
the polychoric correlation for the “Democrats” item 
with the correlations for the “Republicans” item and 
compute the resulting estimates we get a heritability 
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estimate of 31 and a shared environment estimate of 
.17, almost exactly the same as the mean results for all 
28 attitude items. It would appear that affect toward 
the major parties is largely a matter of genetic predis- 
position but that, just as the political socialization liter- 
ature has concluded all along, party identification itself 
is primarily the result of parental socialization. This 
pattern is intriguing in and of itself but it also should 
give pause to those who would dismiss the findings 
on attitude items as the product of some methodolog- 
ical quirk of twin studies. If estimates of heritability 
are somehow artificially inflated, why does this alleged 
contamination not occur for party identification? 

Table 2 also reports the results for a summary 
indicator of educational attainment from the survey. 
We include it here partly because it reflects an actual 
behavior, if only a self-reported one, and partly be- 
cause it carries the role of genetics more directly into 
the world of actual and meaningful social variation. 
Educational attainment is also useful as an example 
of a behavior that is traditionally thought to be heav- 
ily influenced by shared environment, particularly by 
parental example, expectations, and resources. This tra- 
ditional view is supported by the shared environment 
estimate of .46, a figure higher than any of the estimates 
for the 28 attitude items and even somewhat higher 
than the estimate for party ID. What may surprise 
readers is that as important as shared environment is 
to educational attainment, heredity, at .40, is almost as 
important. Taken together, family effects are almost 
the entire story for variation in education attainment. 
The estimate for the impact of the unshared environ- 
ment is only .14, a value markedly lower than any other 
in the table. 


ASSORTATIVE MATING 


Assortative mating is a particular concern here. As de- 
tailed above, the assumption that DZ twins, like any 
other pair of biological siblings, share on average 50% 
of the variable genetic code is crucial to the estimation 
of heritability. This contrasts with MZ twins, where the 
shared proportion is 100%, and the DZ level forms 
the baseline for separating genetics from the shared 
environment. What may not be immediately apparent 
is that the assumption that purely genetic traits in DZ 
twins will on average correlate at .50 is itself built on the 
assumption that their biological parents will on average 
correlate at .00 for the same traits. In other words, the 
assumption is that the parents are not related to each 
other in any close degree, and this is typically true, as 
close relatives generally do not mate, and the amount 
of average shared genetic code drops geometrically as 
we move away in relatedness and quickly approaches 
zero. This assumption that mates are genetically uncor- 
related on the trait of interest is, however, violated if 
mate choice is itself based on the trait of interest. If, 
for example, parents have identical genetic codes for a 
trait of interest, then the shuffling of that genetic code 
produced by sexual reproduction will not result in any 
variation among DZ twins, or any other siblings, with 
regard to their genotype for that trait. In other words, 


DZ twins of these parents will be as genetically alike 
on this one trait as MZ twins are on this trait. Across a 
study population, the higher the proportion of spouses 
that share identical genetics for a trait, the closer the 
DZ correlation will be to the MZ correlation. Since 
heritability of a trait is estimated as 2 * (MZ — DZ), the 
increased similarity of DZ and MZ pairs will lead to an 
underestimation of heritability for this genetic trait. 

This is important for our assessment of the heritabil- 
ity of political attitudes. If there is a tendency for people 
to choose mates with similar positions on political is- 
sues, then the estimates of heritability in Tables 1 and 2 
are biased. Fortunately for us, the direction of the bias 
is uniform and conservative. Any measurable tendency 
toward assortative mating on political orientation will 
push up the DZ twin correlation while leaving the 
MZ correlation unaffected, and this reduction in the 
MZ-DZ gap will have the related effect of lowering 
estimates of heritability. Note also that any increase in 
similarity of DZ twins will inflate the estimate of the 
importance of shared environment, as the estimation 
formula of (2 x DZ) — MZ makes clear. 

The immediate empirical question is how much of 
a role assortative mating plays in political issue posi- 
tions. A quick answer can be found by looking at the 
interspouse polychoric correlations for the individuals 
included in the Virginia 30K study. The average inter- 
spouse polychoric correlation for the 28 items is .41 and 
the individual correlations range from a low of .26 for 
Censorship to a high of .64 for School Prayer. While 
some of this interspouse similarity could plausibly be 
attributed to persuasion effects taking place after mate 
choice rather than to assortative mating, the levels of 
similarity are probably too high to dismiss assortative 
mating entirely. This is confirmed by a preliminary look 
at the impact of controlling for assortative mating on 
these 28 attitude items. The Virginia 30K study in- 
cludes data for parents of twins in the study, including 
parents’ individual responses to the same W-P items 
that the twins responded to. The usable sample size 
does drop substantially when we restrict our analysis 
to only twin pairs with completed W-P results for both 
parents (there are a total of 304 pairs of male/male 
or female/female twins with complete twin and parent 
W-P data, compared to approximately 4,400 pairs in 
the twin only analysis in Table 1). This effectively limits 
us to an assortative mating analysis that focuses on the 
overall index score, rather than looking at each item in 
the inventory individually. 

The approach we used is to compute the partial cor- 
relation for twin similarity in the overall index for the 28 
W-P items, controlling for (partialing out) the influence 
of the degree of parental similarity on the overall index. 
The implication for relative twin agreement is simple; 
if parental agreement results from assortative mating, 
then the resulting increase in genetic similarity will 
increase DZ twin correlations (the more alike geneti- 
cally the parents are on a trait, the more alike siblings 
will be on a trait). Controlling for parental similarity 
will therefore reduce the size of the DZ twin corre- 
lations. However, parental agreement resulting from 
assortative mating and the resulting increase in genetic 
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TABLE 3. Comparison of Australlan and U.S. Estimates of Genetic and Environmental Influences 


on Political Attitudes 
Virginia 30K Data 
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0 04 
0.22 0.16 
Mothers 
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Mean 0.16 
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computed from Martin et al. 1986, Table 1, p. 4385. 


similarity will not increase MZ twin correlations (MZ 
twins are already genetically identical, regardless of 
parental similarity or dissimilarity). Therefore, control- 
ling for parental similarity should have no effect on the 
size of the MZ twin correlations. In contrast, if parental 
agreement results from persuasion or from a shared en- 
vironment for the couple, then the impact of parental 
agreement has no genetic implications and operates 
on their offspring solely through its influence on the 
offsprings’ shared environment. This should produce 
relatively higher correlations of equal magnitude for 
both MZ and DZ twins and, therefore, lead to roughly 
comparable reductions in both the MZ and the DZ 
correlations when we partial out the effect of parental 
agreement. 

The results for a partial correlation analysis control- 
ling for parental agreement are reported in Table 2, 
on the row just below the results for the overall index. 
For MZ twins the issue of whether their parents agree 
or disagree on a particular item makes little difference 
(.65 without control versus .64 after partialing out the 
effect of parental agreement). In contrast, the correla- 
tion between DZ twins decreases modestly when the 
impact of parental agreement is removed (.43 witb- 
out control versus .37 after partialing out the effect of 
parental agreement). Further, the tendency of assor- 
tative mating to deflate estimates of heritability while 
inflating estimates of the impact of shared environment 
is clear. Without controls, the estimate of heritability 
for the overall index is .43 and the average estimate 
of the impact of shared environment is .22. When the 
impact of parental agreement is partialed out, the aver- 
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age estimate of heritability rises to .53, and the average 
estimate of the impact of shared environment drops to 
.11. Note that the traditional socialization account of 
attitude formation is not at odds with this last finding. 
If the issue positions of parents are in conflict, then we 
would hardly expect this shared conflicted setting to 
yield sibling agreement.’ 


COMPARATIVE POLITICAL GENETICS: 
EVIDENCE FROM AUSTRALIA 


Even with a data set as large as the Virginia 30K, ques- 
tions may arise over the extent to which conclusions 
are bound by time and geography. As a result, it is 
helpful to note results from a quite different context 
and a slightly different time period. ‘Table 3 presents 
a comparison of the key summary results in Table 1 
from the Virginia 30K study to comparable results in 
the Australian data described before (Truett et al. 1994; 
see also Lake et al. 2000). While the Australian study 


8 The same sort of control for parental agreement that was applied 
to the WLP inventory was applied to the party affiliation analysis Be- 
cause this is only a single item, the results are much less reliable than 
those averaged across the 28 items However, despite the fact that 
assortative mating clearly takes place with regard to party ID (only 
24 of the 543 parent pairs had opposite party affiliations), the general 
pattern of party ID being due more to shared environment than to 
heredity holds up. Using a very broad definition of disagreement 
(Le, anything short of exact agreement on a five-point scale), the 
shared environment estrmate weakens modestly but remains high, at 
almost twice the heritability estrmate in the subset of twin pairs with 


parents in some degree of disagreement on party affiliation. 
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utilized a larger set of W-P items (50 in all, compared 
to 28 in the U.S. study), the items were a mix of po- 
litical and social items, and only six items appeared in 
exactly the same form in both studies. An additional six 
items were similar enough, in our judgment, to merit 
comparison, and they are included in Table 2 with the 
Australian wording italicized. 

The broad picture from Table 3, and its comparison 
to Tables 1 and 2, is one of remarkable similarity. 
The mean heritability for the 12 item subset of the 
Virginia 30K data is 32 for the full 28 items in Table 1 
and .31 for the 12-item subset of the Australian data. 
The mean estimate for the effect of shared environment 
for the 12 item subset of the Virginia 30K data is .12 
compared to .16 for the full 28 items in Table 1 and .16 
for the 12-item subset of the Australian data. Thus the 
general pattern of a relatively greater role for hered- 
ity compared to shared environment detailed above in 
the discussion of the U.S. data in Tables 1 and 2 also 
applies to the Australian data in Table 3. While most 
of the individual items also have broadly comparable 
results in the two countries, a few, specifically “social- 
ism” and “immigration” (“nonwhite immigration” in 
the Australian study), are noticeably different. In both 
cases the U.S. pattern of substantially higher relative 
heritability is reversed in the Australian data, where we 
see evidence of relatively higher shared environmental 
effects. Whether these are meaningful reflections of 
differences in how these items relate to deeper political 
orientations is not clear, but they are in any case the 
exceptions rather than the rule.’ 


THE GENETICS OF POLITICAL IDEOLOGY 


The possibility that attitudes and behaviors are influ- 
enced by genetic variables is an emotionally charged 
topic so it is important that readers understand the 
claims being made. Partitioning the origins of human 
traits, whether they be physiological or behavioral, into 
the discrete, quantifiable components of genetic inheri- 
tance, shared environment, and unshared environment 
should not be taken to imply that these components 
work separately. Rather these numbers only provide a 
rough indication of the influence of three categories of 
independent variables that are intimately intertwined. 
(Moreover, they are estimates of the ability of inde- 
pendent variables to account for variance in the de- 
pendent variables not for the variables themselves.) 
As mentioned earlier, gene—culture interaction is the 
key to understanding the source of political attitudes 
and behaviors, just as it is the key to understanding 
most physical and behavioral aspects of the human 
condition. Genes do not work in isolation and instead 
generally influence the extent to which organisms are 


? In this case, the different results with regard to socialism could 
reflect different meanmgs of the phrase in the two countries. In 
Australa, the term socialism is closer to a party identification la- 
bel, whereas in the United States ıt has more loaded ideological 
connotations. Likewise, the addition of the qualifier “nonwhite” to 
immigration raises questions of what the key stimulus 1s 
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responsive to particular environmental conditions (see 
Boyd and Richerson 1985 and Masters 1993). 

And this conditioning influence of genetics on com- 
plex social behaviors is not the product of a single gene 
but rather numerous genes that, to make matters more 
complicated, appear to combine in configural as op- 
posed to additive ways. The same set of multiple genes 
may influence behavior in different ways depending 
on the order in which they express themselves and 
the manner in which they interact with other genes. 
Recent discoveries also suggest that biological mark- 
ers of phenotypic manifestations include the manner 
in which DNA is packed in the nucleus, particularly 
the physical location of genes relative to other genes 
and to the histones that help to give DNA its structure. 
An accurate understanding of gene expression appears 
to require knowledge not just of the sequence of nu- 
cleotides (e.g, ATCAGG) that constitutes the gene 
itself but also of the context in which each gene re- 
sides, thus forming an interesting parallel to the way 
we must try to understand the organisms (e.g., human 
beings) genes help to construct (for a good summary, 
see Kosack and Groudine 2004; also see Lykken 1999), 

Individual genes for behaviors do not exist and no 
one denies that humans have the capacity to act against 
genetic predispositions. But predictably dissimilar cor- 
relations of social and political attitudes among peo- 
ple with greater and lesser shared genotypes suggest 
that behaviors are often shaped by forces of which 
the actors themselves are not consciously aware, a 
point that is made with some force by Bargh and 
Chartrand (1999), Marcus (2002), Marcus, Neuman, 
and MacKuen (2000), McDermott (2004), and Wegner 
(2002). It is not biological determinism to posit the exis- 
tence of complex collections of genes that increase the 
probability that certain people will display heightened 
or deadened response patterns to given environmental 
cues. And it is not antibehavioralism to suggest that 
true explanations of the source of political attitudes 
and behaviors will be found when we combine our cur- 
rently detailed understanding of environmental forces 
with a recognition that genetic variables subtly but im- 
portantly condition human responses to environmental 
stimuli. 


IMPLICATIONS FOR POLITICS 


It is important to note that none of the data or argu- 
ments presented in this paper indicates that extant em- 
pirical knowledge about political socialization is use- 
less. In fact, it strongly reinforces many of the most 
salient findings in that research stream. We know from 
that research, for example, that if both parents share a 
political identification, there is a high degree of likeli- 
hood that their offspring will have that same political 
identification (Jennings and Niemi 1968; Tedin 1974). 
Our “twin study” results confirm this finding. One 
of the peculiar findings in the political socialization 
literature even makes more sense when a role for ge- 
netic inheritance in conceded. Scholars have occasion- 
ally puzzled over the fact that family arrangements and 
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styles of operation have little if any impact on the ex- 
tent to which there is a match between parental and 
offspring political attitudes on a wide variety of items 
(see Jennings and Niemi 1968, 180-83). Fathers do not 
have more influence over sons, and mothers do not 
have more influence over daughters; fathers are not 
generally more influential; the distribution of power 
within the family is irrelevant to parent-child correla- 
tions (i.e., neither highly autocratic, highly permissive, 
nor middling arrangements affect the extent to which 
attitudes are correlated); the degree to which children 
and parents feel close to each other does not matter; the 
frequency with which the family discusses politics does 
not much affect correspondence between offspring and 
parent views (though, as we would have predicted since 
it is based on active socialization, party identification is 
more sensitive to family arrangements); and the extent 
to which politics is important to the parents is also ir- 
relevant. Scholars grounded in traditional behavioral- 
ism have difficulty accounting for these “perplexing 
configurations” (Jennings and Niemi 1968, 183), but 
recognizing that the correlations between the views of 
parents and children derive more from genetics than 
familial socialization makes it much less surprising that 
the strength of these correlations is not reliant on fam- 
ily arrangements (for an example of political science 
work that does posit a role for genetics, see Peterson 
1983). 

Still, the substantive findings we present here of- 
fer a direct challenge to common assumptions and 
interpretations that political attitudes and behavioral 
tendencies are shaped primarily or even exclusively 
by environmental, especially familial, factors. Setting 
aside the important special case of party identification, 
we find that political attitudes are influenced much 
more heavily by genetics than by parental socialization. 
For the overall index of political conservatism, genet- 
ics accounts for approximately half of the variance in 
ideology, while shared environment including parental 
influence accounts for only 11%. And in the case of 
the variance in people’s tendencies to possess political 
opinions at all, regardless of their ideological direction, 
genetics explains one-third of the variance, and shared 
environment is completely inconsequential. 

What are the implications of these findings for po- 
litical science? Acknowledging a role for heritability 
in politics affects our understanding of, first, political 
issues, second, political learning, and, third, political 
cleavages. Inherited attitudes seem to be demonstrably 
different than acquired attitudes. Tesser (1993) pro- 
vides evidence that attitudes higher in heritability are 
manifested more quickly, are more resistant to change, 
and increase the likelihood that people will be attracted 
to those who share those particular attitudes. It has long 
been known that certain political issues seem “hard” 
to people, and others seem “easy,” presumably because 
some issues trigger “gut responses” while others do not 
(Carmines and Stimson 1980, 79), but no explanation 
has yet been offered for why given issues do or do 
not elicit gut responses. Why do social, more than eco- 
nomic, issues tend to hit people in the gut, even though 
both constitute ongoing and equally complex societal 
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concerns? In light of the new findings, one distinct pos- 
sibility is that easy “gut” issues tend to be those that 
are more heritable. 

To the extent that political ideologies are inherited 
and not learned, they become more difficult to manip- 
ulate. Conservative parents who try to make their chil- 
dren conservative by carefully controlling their chil- 
dren’s environments are probably overestimating the 
importance of those environments. Offspring of such 
parents are likely to end up being conservative but 
less because of the environment created by the parents 
than the genes passed along by the parents. A politi- 
cal match between parents and children should not be 
taken to be the result of a socialization process—that 
is, the active postnatal transmission of views—just as 
political mismatches between parent and child should 
not be taken as evidence against a role for genetics. 
Parent-child mismatches are distinctly possible given 
the uncertainties of meiosis (the random selection of 
just 50% of each parent’s DNA) and the possibility for 
occasional errors in the transcription and translation 
of genes (mutations). These mismatches are likely to 
be the primary cause of the fact that some children 
rebel against the views of their parents but most do 
not—a pattern that environmental factors have never 
explained satisfactorily. 

Finally, we go into somewhat greater detail to il- 
lustrate the manner in which results such as ours can 
be of use in understanding the divisions characterizing 
virtually all polities and, certainly, the United States 
in the early twenty-first century. Remember, genes in- 
fluence people’s outlooks and personalities, and it is 
these broad features that then predispose individuals 
toward suites of specific attitudes. This interpretation 
likely explains the otherwise puzzling consistency in 
ideological divisions that is present across space and 
time. The package of attitudes held, for example, by 
conservatives in the modern United States is remark- 
ably similar to that held by conservatives in other cul- 
tures and at earlier times in American history (on the 
durability of the liberal-conservative spectrum in the 
United States, see Poole and Rosenthal 1997). Envi- 
ronmental determinists have no convincing explana- 
tion for the pervasiveness of this division but genetics 
does. 

If, as our results suggest, there is a genetic basis for 
the varying political views people hold, and if, as seems 
probable, genetic transmission frequently affects clus- 
ters of political attitudes, we are likely to observe broad 
but distinct political phenotypes. The number of these 
phenotypes may vary, but for purposes of illustration 
we discuss two probable orientations. One is charac- 
terized by a relatively strong suspicion of out-groups 
(e.g, immigrants), a yearning for in-group unity and 
strong leadership, especially if there is an out-group 
threat (“Do not question the President while we are 
at war with terrorists”), a desire for clear, unbending 
moral and behavioral codes (strict constructionists), a 
fondness for swift and severe punishment for violations 
of this code (the death penalty), a fondness for system- 
atization (procedural due process), a willingness to tol- 
erate inequality (opposition to redistributive policies), 
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and an inherently pessimistic view of human nature 
(life is “nasty, brutish, and short”). 

The other phenotype is characterized by relatively 
tolerant attitudes toward out-groups, a desire to take 
a more context-dependent rather than rule-based ap- 
proach to proper behavior (substantive due process), 
an inherently optimistic view of human nature (people 
should be given the benefit of the doubt), a distaste 
for preset punishments (mitigating circumstances), a 
preference for group togetherness but not necessarily 
unity (“We can all get along even though we are quite 
different”), suspicion of hierarchy, certainty, and strong 
leadership (flip-flopping is not a character flaw), an 
aversion to inequality (e.g., support for a graduated 
income tax), and greater general empathic tendencies 
(rehabilitate, don’t punish). 

Common political usage would call the first pheno- 
type conservative and the second liberal, but we seek 
phrases that are less connected to political ideologies 
and that indicate that these two phenotypes run to the 
very orientation of people to society, leadership, knowl- 
edge, group life, and the human condition. Thus, we 
label the first “absolutist” and the second “contextual- 
ist.” This fundamental dimension offers a credible pre- 
cursor to basic cleavages manifested in a broad range of 
human social activity: politics (conservatives/liberals), 
religion (fundamentalists/secular humanists), law (pro- 
cedural/substantive due process), education (phonics/ 
whole language), art (traditional form-based real- 
ism/modern free-form impressionism), sports (foot- 
ball/frisbee), medicine (traditional AMA/wholistic), 
morality (enduring standards/situational ethics), and 
scientific inquiry (formal/empirical). In our view, all 
of these vexing perennial dichotomies are related cul- 
tural expressions of a deep-seated genetic divide in 
human behavioral predispositions and capabilities. We 
certainly are not asserting that everyone holds one of 
these two orientations. Even if the individual genes in- 
volved with absolutism or contextualism tend to move 
together, this does not mean they always do. Some 
individuals may carry, say, an absolutist’s aversion to 
out-groups but a contextualist’s rejection of a univer- 
salistic behavioral code. Moreover, genes not included 
in these central packages, perhaps those related to ex- 
troversion, ambition, and intelligence, often muddy the 
waters. 

More importantly, let us not forget that a heritable 
component of 50% for political ideology and proba- 
bly somewhat higher for the absolutist-contextualist 
dimension still leaves plenty of opportunity for the en- 
vironment to alter attitudes and behaviors—and even 
orientation. An individual with a contextualist geno- 
type who has been repeatedly victimized by out-group 
members, or who has simply spent a great deal of time 
listening to persuasive absolutists, may adopt attitudes 
that run against type. Thus, even if a political system 
started with two pure genotypes, it would soon dis- 
play a fascinating array of expressed orientations and 
beliefs, intensity levels, and degrees of involvement 
even as the system would continue to revolve around 
the central division between absolutists and context- 
ualists. 
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Such an account is speculative at this point but is 
fully consistent with the findings presented here, with 
previous research on the durability of political ide- 
ologies, and with recent events in the United States. 
Accounts of the 2004 election, for example, that do not 
invoke this fundamental difference in orientation have 
fallen flat. Issues did not determine vote choice for the 
many citizens who expressed disagreement with exist- 
ing economic policies and/or the war in Iraq yet still 
voted for the incumbent president, George W. Bush. 
Indeed, if the focus remains on issues, the resultant 
description of the American public 1s grossly at odds 
with reality. Morris Fiorina’s (2005) creative analysis 
of survey responses indicates that Americans can be 
placed in the middle on many important issues, but if 
this is true, then what explains the vitriol and intensity 
of feeling displayed by so many ordinary Americans in 
2004? 

Issues do not explain Americans’ politics Many 
Americans admit that they do not follow or under- 
stand the issues (Hibbing and Theiss-Morse 2002), and 
to the extent they do, they support whatever their 
preferred politician and party seems to support (Page 
and Jones 1979). In the 1990s, a Democratic president 
(Bill Clinton) transformed welfare to workfare; then in 
the 2000s, his Republican successor (George W. Bush) 
greatly expanded federal involvement in both educa- 
tion and the provision of prescription drugs for senior 
citizens. If the enactors of these policies were reversed, 
the groups of citizens displaying support for the policies 
also would have reversed. Similarly, if a Republican 
president had committed adultery with a young intern 
or if a Democratic president had dramatically wors- 
ened the deficit and taken the country to war in a far-off 
land on the basis of undeniably incorrect beliefs about 
the opponents’ nuclear and chemical weapons capabil- 
ities, the positions of most voters on the acceptability of 
these conditions would be completely reversed. Issue 
positions generally reflect divisions; they do not create 
them. 

Instead, the most accurate account of voting behav- 
ior in 2004 moves beyond issues to the basic, partially 
genotypic orientations described above. This sort of 
broad orientation is not far removed from what most 
commentators are trying to capture by reference to a 
“moral” division in the electorate, but without tying it 
to specific moral issues such as gay rights. The chasm in- 
spiring so much hostility between citizens of the United 
States in the early twenty-first century did not divide 
supporters and opponents of privatizing Social Secu- 
rity; it did not even divide supporters and opponents 
of gun control. Rather, as has typically been the case, 
it divided absolutists and contextualists. 

And the prospects for eliminating this divide are 
not promising. Since mate choice appears to be heav- 
ily tilted toward those with similar social and politi- 
cal attitudes, no genetic melting pot exists for these 
traits. Thus, the evidence presented here on assortative 
mating should be quite sobering to those in search of 
unity and togetherness. If anything, the heritability of 
orientation in combination with assortative mating 
may exacerbate the current divide. 


165 


Are Political Orientations Genetically Transmitted? 


May 2005 





But admitting that genetics influences political at- 
titudes could actually help to mute societal divisions. 
Currently, absolutists and contextualists simply do not 
connect, and the result is frustration. To contextualists, 
absolutists appear simplistic and selfish; to absolutists, 
contextualists appear naive and indecisive. Each side 
talks past, and is authentically miffed by, the other. Rec- 
ognizing that our political antagonists probably have a 
different genetic predisposition to people, life, human 
nature, and politics may serve to ease frustrations 
and, eventually, to improve communications across the 
chasm. If absolutists spent more time trying to think 
like contextualists and contextualists trying to think 
like absolutists, understanding would be increased and 
debates could become more constructive. As frustrat- 
ing as it may be to debate with someone who holds such 
different orientations, value exists in recognizing that 
intransigence is not the result of willful bullheadedness 
but, rather, genetically driven differences in orienta- 
tion. 

The exciting next step is to understand the reason 
guch distinct orientations have evolved and lasted. 
Evolutionary psychologists tend to assume that all 
enduring traits are adaptive (for a dissenting view, 
see Gould 2000) since natural selection drives out 
variation and makes adaptive traits ever more com- 
mon. In this organism-based interpretation, whichever 
orientation—absolutism or contextualism—is evolu- 
tionarily superior should soon come to numerically 
dominate the other. This is possible but unlikely. An 
alternative group-based interpretation sees variation 
itself as adaptive (see Alford and Hibbing 2004 and 
Sober and Wilson 1998). The benefits of genetic vari- 
ation are most easily observed in the ability of dif- 
ferential immune systems to prevent a group of or- 
ganisms from being completely wiped out by a single 
pathogen, but it is easy to imagine how sociopolitical 
variation could also create more viable groups. In fact, 
computer simulations give support to the hypothesis 
that divergent individual-level social behaviors, such as 
cooperation and defection, are beneficial at the group 
level (Hammond 2000). As loathe as contextualists and 
absolutists are to admit it, the presence of the other 
orientation may make a society stronger. 
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Ambivalence, Information, and Electoral Choice 
SCOTT J. BASINGER and HOWARD LAVINE Stony Brook University 


onventional wisdom views voter choice in House elections as preordained by party identification, 

incumbency, and perceptions of national conditions. In an analysis of voter behavior in House 

elections between 1990 and 2000, we find instead that voters are quite heterogeneous. Voters who 
hold ambivalent partisan attitudes, who typically constitute 30% of the electorate, reduce their reliance on 
party identification, this effect is entirely independent of the strength of identification. Individuals holding 
ambivalent partisan attitudes that both lack political knowledge and are presented with little campaign 
stimulus are more likely to engage in economic voting. Individuals holding ambivalent partisan attitudes 
that either are knowledgeable about politics or are presented with stimulating campaigns are more likely 
to engage in ideological voting. Thus, campaign competition and national partisan competition each play 
a role in assuring that ordinary voters may participate meaningfully in the political process. 


itizens use a variety of means to evaluate the 
political world, including party, issues, ideology, 
economic performance, and scandals. But must 
every voter attach exactly the same weight to 
each factor? Most models of electoral choice assume 
so, and even when scholars permit different voters to 
weigh the ingredients of electoral choice differently, 
they exclusively identify individuals’ abilities to ob- 
tain and process political information as the source 
of heterogeneity. Whether denoted “conceptualiza- 
tion” (Campbell et al. 1960), “sophistication” (Luskin 
1987), “awareness” (Zaller 1992), “knowledge” (Delli 
Carpini and Keeter 1996), or education (Carmines and 
Stimson 1980; Sniderman, Brody, and Tetlock 1991), 
the standard scholarly view is that the small number 
of “able” voters are more likely than the mass of ill- 
informed voters to rely on more complex (and diag- 
nostic) electoral criteria such as issues and ideology. 
If, however, voter heterogeneity also depends on the 
motivation to acquire and use available information,! 
and not just on ability to do so, then the choice pro- 
cess is more complex. The voter’s cognitive calculus 
involves a trade-off between conflicting goals: accuracy 
requires voters to make decisions that correctly reflect 
their substantive values; efficiency requires voters to 
make decisions without expending too much cognitive 
effort. The “new look” in public opinion research that 
challenged orthodox, reflexive pessimism about voter 
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is going on” (129) with the axiom that modern voters “disti 
between what they must know and what they do not need to know” 
(134) | 


incapacity (Sniderman 1993) heavily emphasized the 
latter, efficiency dimension in its assertion that a “rea- 
soning voter” might rely on shortcuts or heuristics to 
form meani appraisals of candidates (Lupia, 1994; 
Popkin 1991, Sniderman, Brody, and Tetlock 1991)2 
Unfortunately, Lau and Redlawsk (2001) demonstrate 
that neither the use of heuristics alone nor political 
sophistication alone eradicates the sizable probability 
of making incorrect decisions. 

Any description of the origins and consequences 
of voter heterogeneity requires a theory of how in- 
dividuals resolve the tension between accuracy and 
efficiency in a given choice context. A voter might 
seek more information if a campaign subsidizes the 
cost of utilizing additional information, as indicated 
by recent scholarship on Senate elections (Kahn and 
Kenney 1999; Westlye 1991) and presidential pri- 
maries (Alvarez 1998); thus situational differences may 
exist. The desire for accurate decision-making also 
could motivate individuals to acquire and process ad- 
ditional political information. Suppose that individuals 
differ in the reliability they ascribe to various shortcuts 
to electoral choice. A voter might seek more informa- 
tion if the reliability of a shortcut is diminished, and 
the converse is also true: voters who find one shortcut 
to be especially reliable might choose to ignore readily 
available and diagnostic information, such as candidate 
ideology, if they do not perceive it as valuable in raising 
their confidence in a decision. 

In our analysis of voter heterogeneity in House elec- 
tions, we ask: What factors determine the conditions 
under which voters will relinquish reliance on partisan 
cues? And, when they do, what factors determine the 
considerations these voters turn to instead? By iden- 
tifying first, the conditions in which voters are capa- 
ble of reasoning ideologically but choose not to do 
so because partisan cues provide sufficient confidence 
and, second, the conditions in which campaigns help 
elevate voter choice, certain fears about the failure of 
American democracy are assuaged. Voters do not 


2 Electoral choice might be simplified by voter reliance on the 


parties’ ideologies (Downs 1957; Hinich and Munger 1994) or on 
evaluative “running tallies” of performance such as party identifica- 
tion (Fiorina 1981; Rahn 1993) and presidential approval (Enkson, 
MacKuen, and Stimson 2001). 
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attach the same weight to each ingredient of electoral 
choice, but the causes of heterogeneity are driven more 
by the quality of party cues and the quality of cam- 
paigns than by the intellectual qualities of voters. 


INFORMATION AND THE CONGRESSIONAL 
VOTE CHOICE 


When American voters enter the polling booth, they typ- 
ically do not carry much ideological or even issue-specific 
baggage.. . Asa general rule, voters reach their decisions 
on the basis of three considerations: party loyalties, candi- 
date assessments, and overall judgments about the state of 
the nation. (Davidson and Oleszek 2001, 104) 


Deciding whom to vote for in House elections is sup- 
posed to be easy; the three factors mentioned in the 
quote above correctly predict 92% of voters’ deci- 
sions in House elections (Jacobson 2001, 150), and 
none of the three requires citizens to be prospec- 
tive. Stokes and Miller’s (1962) seminal study of the 
1958 midterm election is responsible for the estab- 
lishment of a general tone of pessimism about con- 
gressional elections communicating a mandate, and for 
two durable findings: partisanship is the dominant fac- 
tor in vote choice in congressional elections, and only 
candidates’ “salience” moderates party loyalty. As the 
value of party labels as a guide to voting behavior de- 
clined through the 1970s (Cover 1977; Nie, Verba, and 
Petrocik 1976), scholars’ emphasis logically shifted 
from partisan cues to candidate cues, but painting 
a rosy picture of voter behavior in House elections 
was hardly a result. Findings based on the celebrated 
1978 National Election Study still comprise much of 
the textbook account of congressional voter behavior 
(cf. Jacobson 2001, chap 5), especially the finding 
that voters’ defections overwhelmingly favored incum- 
bents, who typically achieve superior levels of famil- 
jarity, personal favorability, and voter contact (Mann 
1978; Mann and Wolfinger 1980). Presidential approval 
or disapproval also influenced the 1978 House elec- 
tions, although further research has suggested that the 
effects of national forces are variable across electoral 
contexts. In the 1974 and 1982 midterm elections, both 
of which took place during Republican administrations, 
presidential approval had greater impact than in 1978, 
but it only affected voter choices in seats defended by 
Republican incumbents (Abramowitz 1985). Gronke, 
Koch, and Wilson (2003) also indicate that presidential 
approval’s impact on congressional candidate evalua- 
tions is conditional on the incumbent’s support for the 
president’s policies. 

If one wishes to draw an optimistic inference from 
this scholarship, one might assert that voters are re- 
sponsive to the political context, relying on different 
criteria as the situation permits. This inference is re- 
inforced by scholars’ acknowledgment that Senate in- 
cumbents do not share as richly in their House counter- 
parts’ advantages (cf. Gronke 2000 and Krasno 1994). 


3 The year of Stokes and Miller’s survey, 1958, was the peak of party 


loyalty, with 84% of partisans voting the party line; 20 years later 
that rate had fallen to 69% (Mann and Wolfinger 1980, 620). 
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The study of Senate elections has placed special em- 
phasis on how campaign intensity affects the voter’s 
informational environment, articulated by Westlye’s 
(1991, 109) contention: “In a low key race, the average 
voter will have a hard time evaluating the candidates’ 
issue positions or ideological postures. On the other 
hand, in a hard-fought campaign, sufficient informa- 
tion may be generated for many voters to be able to 
choose a candidate on the basis of ideology or indi- 
vidual issues instead of falling back on party identi- 
fication or incumbency as decision criteria.” Gronke 
(2000, 148) echoes this assessment: “In low inten- 
sity elections, voters look to their partisanship, im- 
pressions of the nature of the times, and the incum- 
bent’s reputation for helpfulness. ... In a high intensity 
election... partisanship, presidential approval, and in- 
cumbent helpfulness decline in impact.” What takes the 
place of these factors? Kahn and Kenney (1999) assert 
that “individuals adjust their decision rules depending 
on the closeness of contests. As campaigns become 
more competitive, voters respond by relying more 
heavily on sophisticated criteria, such as ideology and 
issues, when evaluating the opposing candidates... In 
high intensity campaigns, voters have an incentive to 
seek and to reflect on detailed information available 
about candidates. Therefore, voters will be less likely 
to rely on cognitive shortcuts” (6, 26). 

Campaigns might affect voter heterogeneity by 
means other than their intensity. Distinctiveness of can- 
didates’ ideological positions in presidential elections 
(Nie, Verba, and Petrocik 1976; Page 1978) and Senate 
elections (Abramowitz 1981) has been shown to affect 
the relative frequency of issue voting. Franklin (1991) 
found that the clarity of voters’ perceptions increased 
when incumbents stressed issues in Senate campaigns 
and that the incumbent’s voting behavior impacted vot- 
ers’ perceptions of the incumbent’s issue positions. This 
scholarship thus paints an optimistic—but condition- 
ally so—picture: the average voter’s default might be to 
rely on the heuristic or unsophisticated criteria of party 
identification, incumbency, and presidential approval, 
but campaigns that are sufficiently intense or issue- 
oriented might boost reliance on more sophisticated 
criteria. 

In short, previous research into heterogeneity in con- 
gressional vote choice has concentrated on the quan- 
tity of information provided by the campaigns (e.g., 
intensity) and the quality of information about the can- 
didates (e.g., ideological clarity). Kahn and Kenney’s 
analyses nonetheless indicate that hard-fought cam- 
paigns cannot overcome certain individual differences: 
better-informed voters rely more heavily on ideologi- 
cal postures and issue positions than do less informed 
voters across all levels of campaign intensity. We do not 
dispute the informational element of heterogeneity in 
congressional evaluations; indeed voters cannot rely 
on ideological or policy considerations without at least 
a modicum of knowledge about where the candidates 
stand. But Lupia and McCubbins (1998, 2) have argued 
that under most circumstances, even sophisticated citi- 
zens will “disregard most of the information they could 
acquire, and base virtually all of their decisions on 
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remarkably little information.” That is to say, although 
a voter has the ability to vote ideologically, we must 
first identify conditions under which a voter has the 
motivation to use prospective information in making a 
political decision. 


PARTISAN AMBIVALENCE AND THE 
CONGRESSIONAL VOTE CHOICE 


Our model of voter choice builds on two established 
axioms of political judgment. The first axiom is the 
least effort principle, which states that individuals are 
economy-minded, in that they wish to avoid expend- 
ing unnecessary amounts of cognitive energy in reach- 
ing decisions (Fiske and Taylor 1991; Simon 1976). 
This principle provides a compelling explanation for 
individuals’ reliance on informational shortcuts and 
heuristics across a variety of political contexts (Lau 
and Redlawsk 2001; Lupia 1994; Sniderman, Brody, 
and Tetlock 1991). The second axiom is the sufficiency 
principle, which states that individuals are accuracy- or 
validity-minded, in that they desire a certain measure of 
judgmental confidence in their attitudes and decisions 
(Petty and Cacioppo 1986). Together, these principles 
imply that voters will “exert whatever effort is required 
to attain a sufficient degree of confidence” that their 
electoral choices accurately reflect their interests and 
values, but that they prefer “less effortful to more ef- 
fortful modes of information processing” (Eagly and 
Chaiken 1993, 330). Applied to the electoral context, 
this implies that we must evaluate the determinants of 
voters’ confidence in partisan cues, which will deter- 
mine the conditions under which voters are motivated 
to recruit other considerations. 

One possibility is that voters with strong partisan 
identifications achieve sufficient confidence in their 
electoral decisions on the basis of partisan cues alone, 
while voters with weak attachments to a political party 
will seek more information before making vote choices. 
If this is the case, weak partisans and independents 
might be expected to rely more heavily on specific is- 
sues, ideology, or economic performance than strong 
partisans. Another possibility is that it is not the 
strength of identification per se but, rather, the structure 
of a voter’s reactions to party labels that determines 
the contribution of partisan cues to the voter’s elec- 
toral confidence. Research indicates that citizens often 
embrace central elements of both sides of political de- 
bates, expressing both individualist and egalitarian be- 
liefs on the question of welfare spending (Feldman and 
Zaller 1992) and valuing both feminism and religion 
on debates about abortion (Alvarez and Brehm 2002). 
Explanations for this phenomenon vary—few citizens 
possess the political wherewithal to resist arguments 
counter to their values and interests (Zaller 1992), 
policy disputes activate widely shared but incommen- 
surable values (Alvarez and Brehm-2002), and elec- 
toral contests often provide ample amounts of positive 


* See Rivers 1988 for an analysis of heterogenerty in evaluating presi- 
dential candidates that proceeds ın this fashion 


and negative information about each of the compet- 
ing campaigns—but none refute the implication that 
political opinions are infused with conflicting beliefs 
and feelings, in other words, that political opinions are 
frequently ambivalent. 

To date, studies of ambivalence have focused on 
policy issues or on candidates (e.g., Lavine 2001 and 
McGraw, Hasecke, and Conger 2003), but we contend 
that political parties are likely to be especially fertile 
targets of ambivalent opinion. Party labels are strongly 
linked to issue, value, and ideological conflict, as well 
as to support for distinct social groups (eg., Miller, 
Wlezien, and Hildreth 1991). Embracing elements of 
both parties’ policy positions, holding both union mem- 
bership and an evangelical religious orientation, or 
simply admiring salient figures within each party is 
likely to generate attitudinal inconsistency within or 
across these domains, which may translate directly into 
ambivalent partisan attitudes. Most important for our 
purposes, research suggests that when attitudes are am- 
bivalent, they should provide less reliable decision cues. 
Ambivalent opinions are held with less confidence and 
less clarity, and are more difficult to retrieve from 
memory, than relatively one-sided political opinions 
(Alvarez and Brehm 1995; Bargh et al. 1992; Huckfeldt 
and Sprague 2000; McGraw, Hasecke, and Conger 
2003). Consequently they are less stable and exert 
less impact on political choice than univalent opinions 
(Lavine 2001; Zaller and Feldman 1992). By inference, 
voters with ambivalent partisan attitudes should derive 
less judgmental confidence from party cues than voters 
with univalent (i.e., nonambivalent) partisan attitudes 
and must therefore rely on alternative considerations 
to achieve sufficient confidence in their electoral deci- 
sions. l 
As we noted earlier, partisanship is the preeminent 
electoral cue in the House election context. Party iden- 
tification should convey sufficient electoral confidence 
to voters to whom partisan cues are especially reliable, 
and further information processing should have negli- 
gible benefit. Thus when partisan ambivalence is low, 
partisan cues should dominate the decision-making of 
both informed and uninformed voters. Among voters 
holding ambivalent partisan attitudes, however, parti- 
san cues will leave a substantial gap between voters’ 
actual and their desired levels of judgmental confi- 
dence; such voters must recruit other considerations 
to reach their sufficiency thresholds. Partisan ambiva- 
lent voters who are well informed should endeavor 
to evaluate the candidates on the most diagnostic 
(but most costly) criteria, approximating ideological or 
issue voting. Partisan ambivalent voters who are 
poorly informed, and are by all accounts “innocent of 


> The authors of The American Voter recognized that an individual 


voter’s “system of partisan attitudes” could be consistently favorable 
toward one party or that the elements of the system could be in con- 
flict. They speculated that conflicted partisan attitudes would delay 
the formation of presidential voting intention, increase split-ticket 
voting, and reduce concern about the election outcome Echoing 
our argument about the prevalence of party ambivalence, Campbell 
et al (1960) found that fewer than 50% of voters favored one party 
over the other by a margin of two or more attitudes. 
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ideology,” should increase their reliance on simpler 
(1.e., less costly) criteria, especially judgments about 
economic performance. 

Intense congressional campaigns alter voter calcu- 
lus by subsidizing the informational tasks confronting 
voters, as Gronke (2000), Kahn and Kenney (1999), 
and Westlye (1991) have argued. Intense campaigns 
might also heighten voters’ sufficiency thresholds, 
since in hard-fought, openly ideological campaigns, 
voters—especially those who are highly informed to 
begin with—might desire greater levels of judgmental 
confidence in their electoral choices. For either reason, 
intense campaigns should increase voters’ reliance on 
higher cost and higher value criteria, especially among 
voters for whom party identification provides less elec- 
toral confidence (those with ambivalent partisan atti- 
tudes). Intense campaigns should also reduce voters’ 
reliance on low value criteria such as economic per- 
formance. We will test the following four hypotheses 
about voter behavior in congressional elections. 


1. Voters will rely less heavily on party identification 
and presidential approval as their level of partisan 


ambivalence rises. 

2. Highly informed voters with ambivalent partisan at- 
titudes will rely more on ideology and/or issue po- 
sitions than voters with univalent partisan attitudes 
or poorly informed voters with ambivalent partisan 
attitudes. 

3. Poorly informed voters with ambivalent partisan at- 
titudes will rely more on economic perceptions than 
voters with univalent partisan attitudes or highly 
informed voters with ambivalent partisan attitudes. 

4, Intense campaigns will increase voters’ reliance on 
ideology and/or issue positions at all levels of knowl- 
edge, will tend to reduce voters’ reliance on eco- 
nomic perceptions at all levels of knowledge, and will 
therefore reduce the stratifying effects of political 
knowledge. 


In short, we should observe voters with univalent parti- 
san attitudes to be the heaviest dependents on partisan 
cues. We should observe two types of voters engaging 
in ideological voting: those who are ambivalent toward 
the parties and are politically knowledgeable, and those 
who are ambivalent toward the parties and are pre- 
sented with a highly stimulating campaign. We should 
observe voters engaging in economic voting when three 
conditions are met: they are ambivalent toward the 
parties and they are poorly informed and the campaign 
provides little stimulus. That is, campaigns may act as 
a substitute for voters’ knowledge about politics, over- 
coming individual differences and raising the caliber of 
electoral choice by less knowledgeable voters. 


THE MEASURE OF PARTISAN 
AMBIVALENCE 


Before we can turn to empirical analysis, we must 
bear two burdens of proof, demonstrating both that 
ambivalence toward parties occurs with sufficient fre- 
quency that it is worthy of serious study and that am- 
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bivalence toward parties can be operationalized ad- 
equately. As our concern is with positive and neg- 
ative evaluations of the political parties, we employ 
the NES open-ended likes/dislikes question about the 
political parties, which ask: “I’d like to ask you what 
you think are the good and bad points about the two 
national parties. Is there anything in particular that 
you [like/dislike] about the [Democratic/Republican] 
Party?” Four follow-up probes are offered (“Anything 
else?”), thus respondents are invited to provide up 
to five likes and five dislikes for each of the two 
major parties. Wattenberg (1981, 1990) has utilized 
these questions to demonstrate that between 1952 and 
1980, voters’ evaluations of the political parties have 
become more neutral (and not necessarily more nega- 
tive, as Nie, Verba, and Petrocik [1976] had suggested). 

Behavioral conflict theorists (e.g., Mowrer 1960) and 
contemporary attitude theorists (e.g., Hass et al. 1991 
and Thompson, Zanna, and Griffin 1995) have iden- 
tified two necessary and sufficient conditions for the 
arousal of ambivalence. First, positive and negative 
reactions to the attitude object should be similar in 
magnitude; if one component becomes stronger than 
the other, attitudes become polarized and ambivalence 
is decreased. Second, positive and negative reactions 
should be of at least moderate intensity, so that am- 
bivalence is greater when voters have many reactions 
than when they have few reactions. To integrate these 
positive and negative reactions into a numerical index, 
the standard practice in political science® is to employ a 
measure introduced by Thompson, Zanna, and Griffin 
(1995) and used by Lavine (2001): 


P+N 
Ambivalence; = a —]P- N], 


where P and N denote the number of positive and 
negative reactions to an object (e.g., a political party), 
respectively. Intensity is captured on the left by the 
average number of positive and negative reactions; 
similarity is captured on the right by the absolute value 
of the difference between the number of positive and 
the number of negative reactions. 

Our view of partisan ambivalence demands a focus 
not on attitudes toward an individual party but, rather, 
on the extent to which respondents’ reactions consis- 
tently favor one party over the other (e.g., strongly 
liking the Democrats and strongly disliking the Re- 
publicans), or instead are inconsistent, favoring one 
party in some ways and the other party in other ways. 
To construct a comparative index that involves both 
parties, let Pp denote the number of positive reactions 
to the Democratic party, let Pr denote the number 
of positive reactions to the Republican party, and let 
Np and Nr denote the number of negative reactions 
to the relevant party. Define D as the average of the 
positive reactions to the Democrats and the negative 
reactions to the Republicans (i.e., D = (Pp + Nr)/2), 
and define R as the average of the positive reactions 


6 See, og. Lavine 2001; McGraw, Hasecke, and Conger 2003; 
Meffert, Guge, and Lodge 2004, and Steenbergen and Brewer 2004. 
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Prevalence of Partisan Ambivalence, by Year 


and Partisan Identification (Voters Only) 


Disaggregated by Party Identification 


All Voters Democrat 


29.6 26.8 
44.2 52.0 


26.2 
362 


29.1 

45.3 
25.6 
1,358 


26.9 
46.2 
26.9 
925 


16.8 
21.8 
61.4 
1,024 


Ambrvalent 38.3 
Unlvalent 45.5 
Indifferent 18.2 

N 910 


to the Republicans and the negative reactions to the 
Democrats (i.e., R = (Pr + Np)/2). We compare the 
overall intensity of the respondent’s affect toward both 
parties, minus the overall similarity of the respondent’s 
reactions toward the parties, in the following way: 


Ambivalencezgmp = 2ta — |D- R]. 


Pp, Pr, Np, and Np range from 0 to 5; therefore com- 
parative partisan ambivalencescores range from a high 
of +5.0 when reactions to both parties are highly in- 
tense and ambivalent (Pp= Nr= Np= Pr= 5), to 
a low of -2.5 when reactions to the parties are highly 
polarized (when one party is strongly liked and the 
other strongly disliked; e.g., PR= Np= 5, and Nr= 
Pp= 0). 

Table 1 displays the prevalence of partisan ambiva- 
lence using our “comparative” ambivalence measure, 
utilizing data from all National Election Studies con- 
ducted between 1990 and 2000, except 1998 (the party 
likes/dislikes question that we use to code partisan am- 
bivalence was not asked that year).® We classify respon- 
dents as ambivalent if their Ambivalencegamp scores 


7 Intermediate ambrvalence scores result when reactions are less 
intense, 

8 The NES half-sampled the party hkes/dislikes questions in 1990. 
Thus, our analysis throughout reflects only those respondents who 
were asked this battery of open-ended questions 





Independent Repubilcan 


30.0 33.4 
20.0 36.2 
50.0 30.4 

138 


34.3 
38.6 


are positive (indicating a mix of positive and negative 
reactions toward the parties); we classify respondents 
as univalent if their scores are negative (indicating pre- 
dominantly positive or negative reactions toward the 
parties); we classify respondents as indifferent if they 
provide no open-ended responses toward the parties. 
Table 1 indicates that levels of comparative ambiva- 
lence were remarkably high. Well over one-third of 
voters held ambivalent partisan attitudes in 2000, and 
nearly 30% of voters held ambivalent partisan atti- 
tudes in 1990, 1992, and 1994.’ Note that Republicans 
were significantly more ambivalent than Democrats in 
1992 and 2000, while Democrats were more ambiva- 
lent in 1994. Independents are distinctively indifferent 
to the political parties, with 50% or more expressing 
indifferent partisan attitudes in 1990, 1994, and 1996. 
This comparative ambivalence measure, converted to 
a unit scale, is our Partisan Ambivalence variable in the 
empirical work that follows. 


? Respondents with ambrvalent partisan attitudes generally provided 
several likes and/or dislikes toward the parties. For 83% of ambiva- 
lent respondents (averaged across elections), both D and R were 
based on more than one response. Averaged across the five election 
years, the mean sum of ambrvalent respondents’ partisan likes and 
dishkes was 7.38, with an SD of 3.57 and interquartile range (25th 
to 75th percentiles) of 4 8 to 9.8. The average responses among vot- 
ers coded as ambivalent are as follows: 1.78 likes and 1 91 dislikes 
toward the Democrats and 1.71 likes and 1.97 dislikes toward the 


Republicans 
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OTHER VARIABLES 


Let us now turn to our coding of other variables used 
in our empirical analysis of congressional voting. Two 
variables are specific to the voter. Party Identification 
is coded as a trichotomous variable from the seven- 
point scale, taking the value of 1 if the voter self- 
identified as a Republican (Strong, Weak, and Lean), 
0 if the voter is an Independent, and —1 if the voter 
is a Democrat (Strong, Weak, and Lean). Ideology 
is coded as a seven-point measure of ideological self- 
identification, in which 1 denotes the liberal extreme 
and 7 the conservative extreme of the continuum.” 
Four variables measure candidate characteristics. In- 
cumbency is a trichotomous variable, taking the value 
of 1 if the voter’s House seat is occupied by a Repub- 
lican incumbent, 0 if it is an open seat, and —1 if it 
is occupied by a Democratic incumbent. The voter’s 
Familiarity with each party’s candidate is coded 1 if the 
voter recalled the candidate’s name, 0.5 if the voter 
recognized the candidate’s name only, and 0 otherwise. 
The voter’s Favorability denotes the Republican can- 
didate’s net approval advantage over the Democratic 
candidate, coded from the open-ended likes/dislikes 
probes about the candidates, in which respondents are 
asked, “Is there is anything in particular about [candi- 
date] that might make you want to vote [for; against] 
him?” Four follow-up probes are offered again, hence 
respondents are invited to provide up to five likes and 
five dislikes for each of the two major party candidates. 
We create a single comparative favorability scale us- 
ing the formula (Pr + Np) — (Pp + Nr); higher scores 
indicate a more favorable evaluation of the Republi- 
can candidate. Two additional variables are intended to 
capture national political forces. Presidential Approval 
is assessed on a four-point scale where 1 = approve 
strongly, 2 = approve not strongly, 3 = disapprove not 
strongly; and 4=disapprove strongly. Business Con- 
ditions is based on subjects’ perceptions that during 
the past year, the economy as a whole has 1 = gotten 
better, 3=stayed the same, or 5= gotten worse. The 
scales for both variables are converted to a zero-to-one 
range, and the values from 1990 and 1992 (during a Re- 
publican presidency) are reversed so that high scores 
uniformly indicate more likely support for Republi- 
can candidates across elections. Because of our coding 
strategy, seven of these eight predictors of the vote 
choice are expected to have positive coefficients—the 
exception is Democrat familiarity—meaning that vot- 
ers are more likely to vote for the Republican candidate 
if they identify themselves as Republican, if they iden- 


10 Ideally we would prefer to employ the voter’s ideological prox- 
ımıty to the candidates rather than the voter’s raw ideology. Un- 
fortunately, voter placement of candidates 1s widely known to be 
plagued by projection effects, making it unreliable and leading many 
scholars instead to employ the average placements of the candidates 
across the pool of voters. Since we are using House districts, however, 
insuffiaent responses make ıt impossible to create averages with any 
degres of confidence Also, we initially included the respondent's 
attitude toward the single policy item that shows up in all five elec- 
tions in our sample, the government spendi ices question This 
variable had no direct effects and failed to interact with any of the 
moderators and, therefore, was dropped without any consequence 
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tify themselves as conservative rather than liberal, if 
their net affective evaluation of the candidates favors 
the Republican candidate, and so on. 

To isolate the unique effects of partisan ambiva- 
lence, we must control for the effects of two alter- 
native sources of heterogeneity among voters. First, 
politically knowledgeable citizens are better able to 
integrate their political beliefs into a coherent struc- 
ture (e.g., Delli Carpini and Keeter 1996, Luskin 1987, 
Sniderman, Brody, and Tetlock 1991, and Zaller 1992), 
therefore we must separate any effects of ambivalence 
from those stemming from levels of political knowl- 
edge. The variable Knowledge is coded from a test of 
objective political knowledge.'! Second, to assure the 
effects of partisan ambivalence are independent of the 
strength of voters’ partisan attachments, we include 
Partisan Extremity, coded 1 for voters self-identified 
as strong partisans, .67 for moderate partisans, .33 for 
weak partisans, and 0 for independents. The (pooled) 
correlations of Partisan Ambivalence with these alter- 
native sources of heterogeneity are r=.10 for Knowl- 
edge and r= — .14 for Partisan Extremity. 

The two-way interactions suggested by Hypothe- 
sis 1 pertain to the role of partisan ambivalence in 
moderating voters’ reliance on partisan cues in House 
elections. As ambivalence toward the political par- 
ties increases, the influence of party identification 
should decline; given that Party Identification is ex- 
pected to have a positive coefficient, we should ob- 
serve negatively signed interactions between Partisan 
Ambivalence and Party Identification, and perhaps 
between Partisan Ambivalence and Presidential Ap- 
proval as well. The three-way interactions suggested by 
Hypotheses 2 and 3 pertain to the joint influence of 
political knowledge and partisan ambivalence on vot- 
ers’ reliance on ideology and the economy. When a 
voter’s partisan attitudes are characterized by ambiva- 
lence, the voter is motivated to increase reliance on 
nonpartisan considerations in order to achieve suffi- 
cient electoral confidence. Hypothesis 2 states that in- 
formed voters will increase their reliance on ideology, 
but only when partisan ambivalence is high, implying a 
positively signed three-way interaction of Partisan Am- 
bivalence x Knowledge x Ideology. Hypothesis 3 states 
that uninformed voters with ambivalent partisan atti- 
tudes will increase their reliance on alternative easy-to- 
use criteria, such as economic performance; this implies 
a negatively signed three-way interaction of Partisan 
Ambivalence x Knowledge x Business Conditions. We 
postpone discussion of Hypothesis 4, which pertains to 
campaign intensity. 


11 Knowledge was assessed in each election year with items asking 
which party held the most seats in the House and the Senate prior to 
the election and with tems assessing recognition of and knowledge 
about political figures (e.g , Al Gore, Tom Daschle, Newt Gingrich). 
12 Some scholars (e.g., Smith 1980, 1989, and Zaller 1992) have mter- 
preted the absolute number of open-ended responses as a reflection 
of political sophistication. An earher version of the article controlled 
for the voter’s “loquacity” (i.e , the sum of voters’ open-ended reac- 
tions to the parties) but consistently found it to be barely correlated 
with our ambivalence measure and to have zero effect empirically 
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TABLE 2. Logi Estimates of Voter Choice In House Elections, 1990—96 and 2000: 
Dependent Varlabte Is Vote for Republican House Candidate 


Model] 1: 
*Thin"# 
Direct effects 
Party Identification .977°"* 
(.081) 
Presidential Approval .959"** 
(.197) 
Business Conditions .176* 
(.102) 
Ideology 1.885*** 
(.328) 
Incumbency .909*** 
(.074) 
Democrat Famillartty —.484*"* 
( 175) 
Republican Famillartty 1 0627" 
(.178) 
Favorabllity 20.350*** 
(1.038) 
Partisan Ambivalence — 452 
(.495) 
Political Knowledge .817"** 
(.255) 
Two-way Interactions 
Partisan Ambivalence —1.119** 
* Party Identification (.571) 
Partisan Ambivalence —.865 
* Presidential Approval (1.280) 
Partisan Ambivalence 1.210° 
* Business Conditions (.721) 
Partsan Ambivalence 1.046 
* Ideology (2.443) 
Politica! Knowledge .163 
* Business Conditions (.338) 
Polltical Knowledge 1.845* 
* Ideology (1.018) 
Partisan Ambivalence —.706 
* Political Knowledge (1.579) 
Three-way Interactions 
Partsan Ambivalence —6.685*"* 
* Poltica! Knowledge (2.323) 
* Business Conditions 
Partisan Ambivalence 25.752°** 
* Polltical Knowledge (7.383) 
* Ideology 
.146 
Constant (.145) 
N 3552 
Pseudo-A* 626 


Model 1A* 
Model 2: a o e O L 
“Saturated"*° “Not Intense” “Intense” 
Sor" .963*"" 1.118"** 
(.084) (.091) (.193) 
861"*** 1.045*** .889* 
(208) (222) (.470) 
186* .201" 106 
(.105) (113) (257) 
1.739*** 1.617*** 2.899*** 
(.336) (.368) (.795) 
925*** 1.040*** .409** 
(.081) (.083) (.192) 
— g —.319" —.985"*" 
(.193) (.478) 
1.060*** .952*** 1.409*** 
( (.196) (.481) 
21.330**" 18.878*** 25.865*** 
(1.269) (1.125) (2.870) 
—.243 —.520 — 365 
(557) (543) (1.316) 
837*** .581** 1.790*** 
(.267) (.284) (.655) 
—,969* —1.134" —.939 
(.585) (642) (1.612) 
—1.112 —.965 —, 
(1.327) (1.402) (3.775) 
1.357* 1.591** —.091 
(.751) (801) (1.858) 
677 .172 8.361 
(2.568) (2.681) (6.979) 
016 .134 .158 
(.351) (.373) (.884) 
1.879" .703 
(1.191) (1.129) (2.541) 
— 539 —3.371 
(1.785) (1.767) (4.523) 
—7.107*** —8.230"** 2.250 
(2.419) (2 564) (6.983) 
2/7.991*** 26.625**"* 22.210 
(7.959) (7.957) (21.733) 
155 134 121 
(.152) (.170) (.326) 
3552 2817 735 
639 620 .683 


Note *** p < .01 (two-talled), ** p < .05 (two-tailed), * p < .05 (one-talled). (Standard errors In Parentheses.) 


“Model also contains dummy vanables for year, not displayed 


> Model also contains 24 additonal two- and three-way interacbons; see text and footnote 17. 


EMPIRICAL ANALYSIS 


Table 2 contains a logit analysis of the vote decision, uti- 
lizing pooled data from the five elections (1990-2000, 
except 1998). The dependent variable in this analysis 


D We present pooled data from five surveys rather than the individ- 
ual year analyses because of statistical challenges when attempting to 


find “moderator” or interactive effects in fleld studies (as opposed to 
in an experimental environment, m which distributions of treatment 
variables either are randomly assigned by the investigator or occur 
naturally in a roughly normal distnbution). Deviations from optimal 
distributions can have dramatically negative impact on the effec- 
tiveness of statistical tests McClelland and Judd (1993) show that 
attaining adequate statistical power to detect significant interaction 
effects m field studies with badly skewed distributions might require 
100 times the observations that would be needed to show the same 
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is whether the respondent voted for the Republican in 
the House election, taken from the postelection wave 
of interviews. Approximately half of respondents re- 
ported voting in each election, and due to item non- 
response on the independent variables, our total num- 
ber of respondents is 3,552. Columns 2 and 3 provide 
two different specifications of the empirical model for 
all congressional districts: Column 2 contains results 
from the most parsimonious model possible for testing 
Hypotheses 1 through 3; Column 3 contains re- 
sults from an analysis in which all three potential 
heterogeneity-inducing variables (Partisan Ambiva- 
lence, Knowledge, and Extremity) are included both 
as first-order terms and as interactions with each of 
the eight standard predictors of vote choice in House 
elections!4 The results depicted in both Column 2 
and Column 3 show that most of the standard predic- 
tors of House vote choice have statistically significant 
coefficients with the expected signs (ie., all positive 
except for Democrat Familiarity), with the modera- 
tor variables held at their mean values.» The voter’s 
Party Identification, Presidential Approval, and Ideol- 
ogy each directly affect the vote choice in House elec- 
tions. Incumbency, the candidates’ relative Familiarity 
with and Favorability, and Business Conditions also 
each directly affect the vote choice." 

Regarding Hypothesis 1, the primary question of in- 
terest is whether the effects of the partisan cues vary 
across levels of the voter’s ambivalence toward the par- 
ties. Whether looking at the thin model in Column 2 
or the saturated model that controls for other poten- 
tial sources of heterogeneity in Column 3, the Part- 
san Ambivalence x Party Identification interaction is 
negative and statistically significant. The Partisan Am- 
bivalence x Presidential Approval interaction is also 





effects in an expermental setting and provide the following guid- 
ance. “What, then, can field researchers do to improve therr chances 


of detecting interactions or quadratic effects? Obvious methods for 
increasing statistical power are to accept higher rates of Type I errors 
(a strategy not usually acceptable to journal editors) or to increase 
the number of observations. Typical field study joint distributions 
of X and Z are so mefficient that enormous samples are required 
to have the statistical power of optimally designed experiments for 
detecting interactions” (384) Naturally, these problems are com- 
pounded when one is looking for three-way or four-way interactions 
4 The continuous Ambrvalence and Knowledge variables and the 
four-point Extremuty variable were mean-centered; coefficients of 
the first-order predictors are conditional at these variables’ mean 
values. To test the alternative hypothesis stated on page 171, we 
interact Extrernity of party identification with Ideology, Party Ident- 
fication, Business Conditions, Incumbency, and so on. The saturated 
model also includes four additional three-way interactions of Am- 
bivalence x Information with the remaining direct predictors of vote 
choice Results are available on request. 
15 The large number of collinear mteraction terms in Model 2 might 
have caused estimation problems, however, this did not occur. Mul- 
ticollineanty, as indicated by large standard errors and unstable or 
implausible coefficients, 1s not in evidence in Model 2 Standard 
errors and the coefficients in Column 3 are very similar to the “thin” 
model in Column 2, and coeffiaents uniformly take the expected 


signs 
16 Note that we ıncluded dummy variables for each election year, 


1994 ts our excluded case. The results mdicate only that Republican 
candidates garnered more votes m 1994 than in 1992 or 2000 
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negative but is not statistically significant. So when vot- 
ers have ambivalent reactions to the political parties, 
they do reduce their reliance on partisan cues—at least 
the primary cue of party identification—but to what 
criteria do they turn? 

The results in Columns 2 and 3 indicate that reliance 
on ideology depends jointly on high levels of both par- 
tisan ambivalence and political knowledge. The posi- 
tive and statistically significant three-way interaction 
of Partisan Ambivalence x Knowledge x Ideology in- 
dicates that voters increase their reliance on ideol- 
ogy only when they are both knowledgeable and am- 
bivalent about the political parties. As for economic 
voting, the two-way interaction of Partisan Ambiva- 
lence x Business Conditions is positive and statistically 
significant, but this two-way interaction is qualified by 
the expected negatively signed, statistically significant, 
and substantively large three-way interaction of Parti- 
san Ambivalence x Knowledge x Business Conditions. 
Again, it is the joint occurrence of ambivalent parti- 
san attitudes and low political knowledge that drives 
economic voting in House elections.” 

The coefficients reported in Table 2 support our hy- 
potheses about the joint role of partisan ambivalence 
and political knowledge on voter heterogeneity, but we 
recognize that logit coefficients are difficult to interpret 
directly, particularly in empirical models with so many 
interactive terms. To provide real insight into each vari- 
able’s substantive impact on House vote choice, we 
calculate the predicted probabilities of different pro- 
totypical individuals supporting the Republican can- 
didate, based on the characteristics of representative 
voters and a representative pair of candidates. Figure 1 
depicts the moderating impact of partisan ambivalence 
on Party Identification. We distinguish voters at three 
different levels of Party Identification; the figure repre- 
sents voters at low ambivalence and high ambivalence 
(at one standard deviation below and one standard 
deviation above the mean, respectively).18 In general, 
the steeper the slope of the line, the greater the vari- 
able’s ceteris paribus impact on vote choice. Partisan 
ambivalence-induced heterogeneity is indicated by a 
difference in the slopes of the two lines, with the shal- 
lower slope for high-ambivalence voters illustrating 
that partisan ambivalence weakens the voter’s reliance 


17 Column 3 does not display the results of 24 other two- and three- 
way Interactions Of these, only three interactive terms reach conven- 
tional levels of statistical significance. The interaction of Extremity 
and Party Identrficaton has a coeffiaent of 1286 and a standard 
error of .314, indicating that strong partisans rely more on their own 
identification than weak partisans and independents, however, the 
results sought by Hypothesis 1 are robust to melusion or exclusion 
of this mteraction. The interactions of Favorabulity with Extremity 
and with Ambrvalence each have negative coefficients, indicating 
that independents, weaker partisans, and individuals with ambivalent 
partisan attitudes rely more heavily on affect toward the candidates 
than strong partisans or mdividuals with univalent partisan attitudes. 
18 Although Ambivalence and Knowledge are only minimally cor- 
related, there 1s greater variance in ambrvalence among the knowl- 
edgeable. Accordingly, ın computmg predicted probabilities in which 
Ambivalence and Knowledge are varied simultaneously, we use the 
following standard deviation values for Ambivalence’ 09 for low- 
Knowledge indrviduals and .18 for high-Knowledge individuals. 
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FIGURE 1. 


Party Identification 


0.80 


on the central cue of party identification. Self-identified 
Democrats with univalent party attitudes had an esti- 
mated probability of voting for the Republican House 
candidate of .27, compared to .78 for self-identified 
Republicans with univalent partisan attitudes. Mean- 
while, self-identified Democrats with ambivalent party 
attitudes have an estimated probability of voting for the 
Republican House candidate of .33, compared to .68 
for self-identified Republicans with ambivalent party 
attitudes, Partisan identifiers with ambivalent partisan 
attitudes display less loyalty to their own party’s can- 
didate than univalent copartisans; Hypothesis 1 is thus 
handsomely supported. 

Demonstrating the extent of empirical support for 
Hypotheses 2 and 3 requires illustrating three-way in- 
teractions. Generically, a three-way interaction states 
that the interactive effect of two variables varies across 
levels of a third variable. To explicate the complex 
multiplicative terms in Table 2, Figures 2 and 3 display 
predicted probabilities of voting for the Republican 
House candidate for two levels of Ideology and two 
levels of perceptions of Business Conditions, respec- 
tively. Figure 2 illustrates the interactive effects of po- 
litical knowledge and partisan ambivalence on a voter’s 
reliance on ideology. The first panel displays the pre- 
dicted probabilities in the low-partisan ambivalence 
state for knowledgeable and unknowledgeable voters 
separately; the second panel displays the predicted 
probabilities in the high—partisan ambivalence state. 
Since univalent voters should derive sufficient electoral 
confidence on the basis of party cues, the behavior 
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of knowledgeable and unknowledgeable voters should 
differ less in the low-ambivalence state than in the 
high-ambivalence state. This pattern is demonstrated 
by the shallow and nearly parallel slopes of both lines 
in the first panel in Figure 2: the difference between 
the predicted probability of voting for the Republi- 
can candidate for unknowledgeable liberals (.34) and 
that for unknowledgeable conservatives (.53) is 19 per- 
centage points; the difference between the predicted 
probability of voting Republican for knowledgeable 
liberals (.50) and that for knowledgeable conservatives 
(.63) is 13 percentage points. In the high-ambivalence 
state, illustrated in the second panel in Figure 2, only 
knowledgeable voters increase their reliance on ideol- 
ogy. The difference between the predicted probability 
of voting Republican for knowledgeable liberals (30) 
and that for knowledgeable conservatives (.73) is a 
hefty 43 percentage points, while the predicted prob- 
ability difference for unknowledgeable liberals (.38) 
and unknowledgeable conservatives (.46) is an anemic 
8 percentage points. Simply, individual differences in 
political knowledge stratify voters’ reliance on ideol- 
ogy, but only when the voter’s ambivalence toward the 
political parties is high, supporting Hypothesis 2. 
Figure 3 illustrates the interactive effects of politi- 
cal knowledge and partisan ambivalence on a voters’ 
reliance on economic perceptions. In contrast to the ef- 
fect of ideology, we observe that (as hypothesized) only 
unknowledgeable voters increase their reliance on eco- 
nomic perceptions as partisan ambivalence increases. 
The first panel in Figure 3 compares the predicted 
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FIGURE 2A. Ideology—Low Partisan Ambivalence 
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FIGURE 2B. Ideology—High Partisan Ambivalence 
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FIGURE 3A. Economic Perceptions—Low Partisan Ambivalence 
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FIGURE 3B. Economic Perceptions—High Partisan Ambivalence 


0.80 


d 
————— 
: 062 


Ambivalence, Information, and Electoral Choice 


probability of voting Republican as a function of eco- 
nomic perceptions for voters in the low—partisan am- 
bivalence state. (Recall that the economic perceptions 
variable has been coded in reverse for 1990 and 1992, 
so our usage of the terms “better” and “worse” must 
be reversed when thinking about a Republican presi- 
dency.) The difference in the predicted probabilities of 
voting for the Republican House candidate between 
voters who perceive the economy as improving and 
voters who perceived it as worsening is small: —4 per- 
centage points for unknowledgeable voters (implying 
that they are using economic perceptions erroneously) 
and +9 percentage points for knowledgeable voters. 
The second panel displays the predicted probability for 
voters with ambivalent partisan attitudes. A decrease in 
these voters’ political knowledge sharply increases vot- 
ers’ reliance on economic perceptions: the difference 
between the predicted probabilities (i.e., from worse 
to better) rises from +2 percentage points for knowl- 
edgeable voters to +18 percentage points for unknowl- 
edgeable voters. In other words, unknowledgeable vot- 
ers with ambivalent partisan attitudes are substantially 
more likely to engage in economic voting in House 
elections than those with either univalent partisan at- 
titudes or political knowledge. 

In summary, Figures 2 and 3 illustrate our two- and 
three-way interaction hypotheses and our thesis that 
knowledge differences between voters will induce het- 
erogeneity only when voters hold ambivalent attitudes 
toward the political parties. Consistent with Hypothe- 
ses 2 and 3, knowledge fails to stratify voters’ reliance 
on either ideology or economic perceptions when am- 
bivalence is low, in which case voters can reach con- 
fident electoral decisions largely on the basis of par- 
tisan cues and incumbency. Only when ambivalence 
is high, and partisan cues have become less reliable, 
does knowledge makes a difference. In this circum- 
stance, knowledgeable voters rely more on ideology, 
while unknowledgeable voters rely more on economic 
perceptions to achieve sufficient confidence in their 
electoral choices. 

We now address the question of how campaigns 
might alter the picture of heterogeneous voting that 
we have drawn and, in particular, how intense House 
campaigns change the voter’s calculus. Columns 4 and 5 
utilize the model of Column 2 but separate individuals 
into two categories, depending on whether the House 
campaign was low intensity or high intensity. To create 
these categories we use a three-part test. First, most 
scholars have argued that an important criterion of an 
intense race is that campaign spending be competitive; 
the typical measure of financial competitiveness is that 
the incumbent candidate should not spend more than 
double the amount of the challenger. We are not es- 
pecially interested in seats where the incumbent is so 
safe that he or she spends very little to defend the 
seat, so we require that, in addition to the ratio not 
exceeding 2:1, total spending must exceed a threshold 
level. Second, some scholars have viewed raw spending 
by the challenger to be an important criterion of com- 
petitiveness, regardless of the amount spent by the in- 
cumbent, particularly in light of Jacobson’s (1981, 1990) 
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contentions that incumbent spending matters less than 
challenger spending and that all spending is subject to 
diminishing returns. Krasno (1994) uses a threshold of 
$300,000 spent by the challenger in the 1988 House 
election; we use the $300,000 threshold for 1990 but 
then adjust the threshold up by 18% per election to 
account for the actual level of inflation in campaign 
expenditures in the 1990s. Third, intense races should 
also have a competitive result (closeness is potentially 
both a cause and a consequence of intensity). 

Taking these considerations together, we coded a 
campaign as competitive if (1) the margin of victory 
is less than 20 points and if either (2A) no candidate 
spends more than double his or her opponent, with 
total spending exceeding the threshold amount, or (2B) 
challenger spending exceeds the threshold amount. For 
the elections in our sample, this multistep criterion 
selects approximately 21% of districts as intense, on 
average 93 districts per election year.” As indicated 
by the N’s in Columns 4 and 5, four-fifths of survey 
respondents in our sample were not exposed to an in- 
tense House race. The results depicted in Column 4 
bear very strong resemblance to those in Columns 2 
and 3 in terms of the coefficients’ signs and their statis- 
tical significance. Therefore these results confirm that 
in the typical, low-intensity House race, voters who 
are ambivalent toward the parties reduce their reliance 
on party identification and increase reliance on either 
business conditions or ideology, depending on whether 
they are uninformed or high informed, respectively. 

Hypothesis 4 pertains to the effect of campaign in- 
tensity on voting behavior; we anticipate that voters 
will rely more heavily on ideology and less heavily 
on economic perceptions. This should manifest itself 
in Column 5 (in comparison to Columns 2 and 4) 
as larger coefficients on Ideology and Partisan Am- 
bivalence x Ideology and as smaller coefficients on 
Business Conditions, Partisan Ambivalence x Business 
Conditions, and Partisan Ambivalence x Knowledge x 
Business Conditions. The effects of partisan ambiva- 
lence are not expected to differ, although we might 
observe some of the action moving from the three-way 
interactions involving knowledge to the two-way inter- 
actions as the stratifying effects of knowledge diminish 
in these high-intensity races. A cautionary note before 
delving into the results: because so few campaigns are 
intense, our sample size is 735, barely 20% of our orig- 
inal sample, therefore the standard errors have dou- 
bled or tripled across the board; our attention is conse- 
quently oriented toward a comparison of coefficients. 
The expected patterns obtain in Column 5 (intense 
elections), in comparison to Column 4 (nonintense 


19 Of the 466 mtense races (of 2,207 ın our sample), only 58 races 
came in the 1990 election year—considered by Jacobson (2001) to be 
perhaps the least sumulating election in the past three decades—as 
opposed to 93 races in 1992, 119 races in 1994, 118 in 1996, and 78 
in 2000. Of the 466 races, 227 satisfied both campaign expenditure— 
based criteria, 174 satisfy only the second, and 65 satisfy only the first. 
Our test identifies 51.4% of open-seat races, 36.8% of incumbent- 
contested races with experienced challengers, and only 102% 
of incumbent-contested races with inexperienced challengers as 
intense. 
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elections). The coefficient on Ideology increases from 
1.62 to 2.90, and the coefficient on Partisan Ambiva- 
lence x Ideology increases from .17 to 8.36. Thus, the 
typical voter exposed to a high-intensity campaign finds 
it much easier to engage in ideological voting, although 
this is still primarily the choice of those who are moti- 
vated to do so by partisan ambivalence. Meanwhile, 
the coefficient on Business Conditions falls slightly, 
from .20 to .11, but the two-way interaction of Partisan 
Ambivalence x Business Conditions falls from 1.59 to 
—.09, and the previously significant three-way interac- 
tion of Partisan Ambivalence x Knowledge x Business 
Conditions switches signs and falls from —8.23 to 2.25 
(and is now nonsignificant). These results indicate that 
economic perceptions have no impact among the typ- 
ical voter exposed to a high-intensity campaign and, 
in particular, are no longer a cue relied on by high- 
ambivalence, low-knowledge voters. 

By stratifying campaigns according to their inten- 
sity, we have introduced, in effect, a rather complex 
four-way interaction, whereby the electoral impact of 
ideology now varies across levels of ambivalence, 
knowledge, and intensity. Our basic hypothesis was that 
voters will engage in ideological voting only when they 
are motivated to do so by their ambivalent attitudes to 
political parties, and only when their personal knowl- 
edge about politics or intense campaigns enables them 
to do so. Figure 4 illustrates this relationship by pre- 
senting the predicted probabilities of voting for the 
Republican House candidate for liberals and conserva- 
tives, at two levels of political knowledge, at two levels 
of campaign intensity; all other variables are held at 
their mean values. The first panel in Figure 4 depicts 
the behavior of voters with univalent partisan attitudes; 
comparing it to the first panel in Figure 2 reveals no 
especially interesting differences. The slopes of these 
lines for knowledgeable voters are consistently shallow, 
indicating a lack of ideological voting, although un- 
knowledgeable voters with univalent partisan attitudes 
do show some signs of ideological voting in high inten- 
sity campaigns. The second panel in Figure 4 depicts the 
behavior of voters with ambivalent partisan attitudes. 
Only one line has a shallow slope here: unknowledge- 
able voters exposed to low-intensity campaigns. Thus, 
an increase in either campaign intensity or personal 
political knowledge dramatically increases the impor- 
tance of ideology in the voter’s decision-making pro- 
cess. 


2 We also recomputed the effects of party identification and busi- 
ness conditions in high-mtensity and low-intensity campaigns, and 
could have included two additional figures that would correspond 
to Figures 1 and 3. In low-intensity campaigns, the differentiating 
effect of Party Identification falls from 52 among low—partisan am- 
bivalence voters to 37 among high-partisan ambivalence voters 
In high-intensity campaigns, its differentiating effect falls from 54 
among low-ambivalence voters to .44 among high-ambrvalence vot- 
ers As for Business Conditions, the differentiating impact is less 
than .06 for all but two types of voters. unknowledgeable voters with 
univalent partisan ambivalence m low-intensity campaigns (. 11) and 
unknowledgeable voters with ambivalent partisan attitudes in low- 
intensity campaigns ( 18). Tables of predicted probabilities or figures 
illustrating them are available on request 


CONCLUSION 


The findings in this paper are consequential for some 
of the fundamental questions of American political 
behavior. Under what conditions are voters satisfied 
to rely only on partisan cues? Under what conditions 
do individuals engage in ideological or issue voting? 
What are the effects of political campaigns? To ad- 
dress the first question, the least effort and sufficiency 
principles generate the theoretical expectation that cit- 
izens with univalent partisan attitudes derive adequate 
confidence in their electoral choices on the basis of 
partisan cues, while voters with ambivalent partisan 
attitudes will reduce their reliance on these cues. This 
hypothesis is consistent with the way ambivalence is 
typically studied (e.g., Lavine 2001), potentially yield- 
ing a pessimistic inference that political behavior is 
less predictable when tainted by ambivalence. The suf- 
ficiency principle sanctions an entirely new theoretical 
expectation, however, that because voters with am- 
bivalent partisan attitudes experience a discrepancy 
between actual and desired levels of confidence in 
their electoral decisions, they are motivated to expend 
additional cognitive resources to compensate for the 
reduced value of partisan cues. Only voters who are 
politically informed (i.e., knowledgeable or presented 
with an intense campaign) are capable of increasing re- 
liance on the highly diagnostic, prospective criterion of 
ideology. Meanwhile uninformed voters do not aban- 
don the quest for judgmental confidence when partisan 
ambivalence is high; rather they increase reliance on 
the easier-to-use, retrospective criterion of government 
performance. 

Our results, limited as they are to the House con- 
text, point to a larger conceptual analysis of the 
voter’s strategic flexibility. Prior conceptions of elec- 
toral decision-making characterized the American 
voter as either unable or unwilling to exert more than 
heuristic or low-effort thinking in the political realm. 
More recent thinking holds that informed voters rely 
on abstract considerations such as ideology, while un- 
informed voters fall back onto heuristics such as party 
identification, group affect, and the like. We find in- 
stead that the choice to turn away from party and 
toward ideology depends initially on the structure of 
a voter’s attitudes toward the political parties; voters 
for whom party labels generate little attitudinal conflict 
can and do rely confidently on those labels regardless 
of their political knowledge or the intensity of a cam- 
paign. Thus we conclude that partisanship is not merely 
a crutch for the politically uninformed, unsophisticated 
or unstirred by campaigns. 

Our results also clarify important consequences of 
political campaigns.) Campaigns that provide insuffi- 
cient stimulus leave voters to their own devices: some 
will rely mainly on partisan cues; some will have the 
motivation and knowledge to vote ideologically; others 
will engage in economic voting, providing little man- 
date to elected officials. Campaigns hold the promise 
of informing voters who possess little political knowl- 
edge of their own, but the impact of campaigns on in- 
creasing ideological voting depends on whether voters 
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FIGURE 4A. Ideology—Low Partisan Ambivalence 
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FIGURE 4B. Ideology—High Partisan Ambivalence 
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are motivated by their own ambivalent partisan atti- 
tudes to pay attention to campaigns’ policy content. 
Thus an observed absence of ideological voting does 
not automatically imply any fault on behalf of vot- 
ers or of campaigns. Rather, voters can be rationally 
ignorant—refusing to process information on ideolog- 
ical and issue positions yet still assuring that their deci- 
sions represent their values—so long as their attitudes 
toward political parties are clear-cut. 

The relationship between constituent preferences 
and representatives’ decisions has long caused anxiety 
among students of congressional elections (cf. Fiorina 
1974 and W. Miller and Stokes 1963); our demonstra- 
tion that ideology is potentially consequential in the 
minds of individual voters provides justification for the 
hypotheses that representatives’ position-taking can 
have electoral consequences (see too Canes-Wrone, 
Brady, and Cogan 2001). We find that ordinary indi- 
viduals’ involvement in the political decision-making 
process is more meaningful—disciplining elected offi- 
cials more effectively—when vigorous campaigns are 
waged. Likewise vigorous partisan competition at the 
national level clarifies party brand names, which—if 
this decreases partisan ambivalence—trelieves ordinary 
individuals of the burden of ideological voting while 
permitting them to participate meaningfully in pol- 
itics. In short, voter efficacy does not require vast 
amounts of political knowledge, but intense campaigns 
and clear partisan cues are necessary antidotes, con- 
firming Schattschneider’s (1960, 137) argument that 
“the people are powerless if the political enterprise 
is not competitive.” 
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When Moderate Voters Prefer Extreme Parties: Policy Balancing 


in Parliamentary Elections 
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is work develops and tests a theory of voter choice in parliamentary elections. I demonstrate that 

voters are concerned with policy outcomes and hence incorporate the way institutions convert 

votes to policy into their choices. Since policy is often the result of institutionalized multiparty 

bargaining and thus votes are watered down by power-sharing, voters often compensate for this watering- 

down by supporting parties whose positions differ from (and are often more extreme than) their own. 

I use this insight to reinterpret an ongoing debate between proximity and directional theories of voting, 

showing that voters prefer parties whose positions differ from their own views insofar as these parties 

pull policy in a desired direction. Utilizing data from four parliamentary democracies that vary in their 
institutional design, I test my theory and show how institutional context affects voter behavior. 


t the core of democratic theory is the notion 
A of competitive elections taking place at regular 

intervals. Citizens use elections as mechanisms 
by which they hold politicians accountable and express 
discontent, as tools for pointing in the direction they 
want policy to take, as means for placing issues on the 
public agenda, as an occasion for public deliberation, 
and as opportunities for choosing delegates or trustees. 
As Powell (2000) describes them, elections are instru- 
ments of democracy. 

How do voter preferences on issues translate into 
vote choice? To understand outcomes of any given 
election, one needs to sort out both the currency of 
the election—what voters care about—and how what 
they care about affects their choice. There is a gen- 
eral agreement among students of elections that is- 
sues matter (e.g., Barnes 1997). How they matter is 
unclear. According to current approaches, voters as- 
sess party positions (platforms) on the relevant issues 
with respect to their own views, employ some decision 
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rule (e.g., similarity between party positions and voter 
views), and evaluate parties based on this rule. How- 
ever, despite the voluminous literature on voter be- 
havior, as far as issues go, much of the cross-national 
and cross-individual variation in voter behavior is left 
unexplained; political scientists disagree about which 
theoretical approach best explains how preferences on 
issues affect vote choice (e.g., Iversen 1994a and Lewis 
and King 2000). 

Would we imagine voters employing the same 
decision rule in, say, the 2001 British elections, where, 
as expected, the Labour Party alone secured a solid ma- 
jority in the House of Commons, as in the 2003 Israeli 
elections, where 13 parties gained seats in the Knesset, 
four of which hold cabinet positions at the time of 
writing? More generally, do voters in majoritarian 
systems employ the same strategy as their counterparts 
in consensual democracies? Under current scholarly 
frameworks, the answer is a clear yes. Focusing on 
voter evaluation of party positions, current frameworks 
of issue voting imply that postelectoral bargaining is of 
little importance. If platforms are indeed the object 
of voter evaluation, then the path from votes to seats 
to government formation to legislation to policy—and 
therefore much of what is regularly considered as the 
heart of politics—is inconsequential for voter assess- 
ment of parties. Thus, if voters evaluate parties based 
on their platforms, we should expect the principles 
that underlie their choices to hold across institutional 
environments regardless of the length and features of 
the path leading from votes to policy. The unexplained 
variation in voter behavior across democratic systems, 
however, suggests, that, possibly, something else is at 
work. 

When a party wins election in majoritarian system, 
it can usually implement its preferred policy with little 
compromise; when Blair and his party are in power, 
they are in power. In contrast, according to standard 
portrayal of a victory in consensual systems, the win- 
ning party faces a long and winding path leading from 
election results to policy, with bargaining and compro- 
mise awaiting at each turn. If voters are at all concerned 
with policy outcomes, then the path from the announce- 
ment of election returns to policy is of much relevance 
to their choice. They might prefer one party when 
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they predict that their vote will be watered down by 
institutionalized bargaining and a different one when 
they predict a short, straight, path from votes to policy. 
How voters’ opinions on issues affect their vote 
choice is critical for our understanding of representa- 
tion. In many systems we observe ideological discrep- 
ancy between parties and their constituencies, with the 
former often being more extreme than the latter on is- 
sues (e.g., Adams and Merrill 1999 and Iversen 1994b). 
Does it imply that party elites do not listen to vot- 
ers’ voice? Or perhaps they do, and this discrepancy is 
consistent with voter preferences. Perhaps voters, pre- 
dicting their vote to be watered down along the path, 
prefer parties to hold positions more extreme than 
their own opinions. This discrepancy between voter 
opinions and party positions on issues may or may not 
be a concern, depending on the decision rule voters 
employ when evaluating party positions on issues. If 
voters are concerned with platforms, ideological incon- 
gruence between voters and parties raises a potential 
concern of deficient representation. If, however, voters 
are concerned with policy outcomes, they might prefer 
parties’ positions on issues to differ from their own 
views, and therefore this discrepancy is of less concern. 
The answer one offers to the question posed above 
holds implications not only for our understanding of 
voter behavior and of questions of representation, but 
also for our practice as students of electoral politics. If 
current frameworks hold empirically, and voters evalu- 
ate party positions vis-a-vis their own views, then voter 
behavior will be the same when the rules of the game 
vary. However, if voters are concerned with policy out- 
comes, endorsing parties that pull outcomes in their 
direction and shunning those that push outcomes away 
from them, they might take into consideration the in- 
stitutional mechanisms that convert votes to policy, and 
so their choice will depend on how much they expect 
their vote to be watered down by these mechanisms. 
In other words, voter behavior will reflect expectations 
about postelectoral bargaining in the legislature. If this 
is the case, to understand voter choice with regard to 
issues, political scientists ought to move beyond the 
institutional/behavioral dichotomy and incorporate in- 
stitutional context into the study of behavior. 


ACCOUNTS OF VOTER BEHAVIOR: CHOICE 
OVER PLATFORMS 


The spatial model (Downs 1957; Hotelling 1929) char- 
acterizes party competition in a two-party system 
where the winner can implement his or her policy plat- 
form. Under this system, each voter chooses a party 
such that the outcome is spatially closest to his or her 
own bliss point. The result of this model is well known; 
the two parties will adopt the policy position of the 
median voter. However, in the Downsian world, a two- 
party system where the winner can implement his or 
her preferred policy, policy outcome is identical to the 
winner’s position (assuming a binding platform), and 
thus the same result can be achieved by either voting 
over policy outcomes or voting for the party whose 
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position is the most similar to the voter’s position. Em- 
pirical research adopts the latter motivation—voting 
over platforms—and formulates voter utility for par- 
ties as negatively related to the distances between the 
voter’s and the parties’ issue positions (Enelow and 
Hinich 1984). 

With a few exceptions, the vast empirical literature 
holds that voter evaluation of parties depends on party 
positions on issues (platforms). This literature revolves 
around a major point of contention: whether voters 
prefer parties whose platforms are most similar to their 
own or parties whose platforms are more extreme than 
their own. Proponents of Proximity Model argue that 
voters prefer parties that are ideologically similar to 
their own views on the issues (e.g., Blais et al. 2001 
and Westholm 1997). Disparately, supporters of the 
Directional Model argue that voters prefer parties that 
are ideologically in the same direction as, but are more 
intense than, their own views on the relevant issues 
(e.g., Macdonald, Listhaug, and Rabinowitz 1991 and 
Macdonald, Rabinowitz, and Listhaug 2001). Despite 
the conviction of scholars within each camp, numerous 
studies comparing the explanatory power of the two 
models in accounting for voter choice find mixed evi- 
dence (e.g., Cho and Endersby 2003, Lewis and King 
2000, and Pierce 1997). 

In the heated debate between the two camps, 
Downs’s original argument of outcome-oriented voting 
has been set aside. But as Bailey (2001, 6) reminds us, to 
the extent that a voter wakes up in sweat in the middle 
of the night thinking about politics, it is policy outcomes 
and their immediate effect that are on his or her mind. 
According to Downs, the voter “weigh(s) the perfor- 
mance that the opposition party would have produced 
in period t if it had been in power” (40).' He or she 
then engages in a counterfactual thought-experiment 
comparing the utility he or she would receive under the 
two alternative scenarios: 


... [T]he most important part of a voter’s decision 1s the 
size of his current party differential, i.e., the difference be- 
tween the utility income he actually received in period t 
and the one he would have received if the opposition had 
been in power. (40, emphasis ın the original) 


In the Downsian two-party setup this logic translates 
into a difference between voter utility from having 
party A in office and voter hypothetical expected 
utility from having the opposition party, B, in office: 
UA — E(U®), where t is the current period (40). While 
still assuming a winner-take-all setup, Downs extends 
this decision rule to multiparty systems: “In the latter, 
the voter follows the same rules as in the former, but 
compares the incumbent party with whichever of the 
opposition parties bas the highest present performance 
rating, i.e., would yield him the highest utility income 
if it were now in office” (47).? 


1 For a more extensive discussion, see Downs’s (1957) account of 
“The Logical Structure of the Voting Act” (38-49). 

2 Building on Downs, Grofman (1985) follows a similar logic. He 
offers a model of voter choice based on “directionality and magni- 
tude of expected shifts from the status quo” (abstract). In Grofman’s 
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In the following section I present a model in line 
with the (neglected) Downsian notion of outcome- 
oriented behavior. Contrary to the voluminous empir- 
ical literature, I contend that even when focusing on 
issues, voter behavior is largely policy-oriented. Un- 
der my framework, the rationale for preferring a party 
whose positions are different from (and not necessarily 
in the same direction as) the voter’s views is policy 
oriented—voters endorse parties insofar as the parties 
pull policy outcomes in a desired direction. 


VOTING TO ACHIEVE OUTCOMES: A MODEL 
OF INSTITUTIONAL BALANCING 


Intultion 


“What Would It Look Like Without Them.” Imagine 
a German voter sitting at home, watching the evening 
news, and asking him- or herself: What would it have 
looked like if nobody cared about the environment? 
What would it have looked like without the Greens? 
Do the Greens pull policy in my direction or away from 
it? He or she then evaluates the party’s impact on policy 
by comparing policy to a counterfactual policy, the pol- 
icy we would have observed were the Greens absent 
from the policy-formation process. The voter rewards 
the Greens if they pull policy toward his or her position 
and penalizes them if they pull it away. 

A policy outcome that best represents a voter’s issue 
position in a parliamentary system can be produced— 
depending on the institutional environment—either by 
a single party in the parliament holding the same po- 
sition as the voter or by compromise among multiple 
parties in parliament. In the case of the latter, outcome- 
oriented voters may prefer a party to their right (left) 
if the center-of-gravity in the parliament is to their left 
(right) to a party whose position is identical to their 
own. Other things equal, a party placed exactly at the 
voter’s position will be less effective in balancing out 
any force than a party at the opposite side of the force 
with respect to the voter. 

Similar to the German voter, we can imagine a 
moderate-left Swedish voter endorsing the Left Party 
not necessarily because she hopes for socialism to guide 
policy but, rather, out of concern that “if the Social 
Democrats are in power, they will coalesce with the 
Center Party and privatize, privatize, privatize, pri- 
vatize....” Indeed, coalition constraints and the need 
to rely on legislative support of centrist parties often 
“lock” the two major parties from pursuing divergent 
policies and serve as an opportunity for voters to sup- 
port extremist parties. Kitschelt (1995) describes this 
pattern as one of the forces leading to the strength- 


model voters examine “how successful potential officeholders are 
likely to be ın implementing changes from the status quo in the direc- 
tion they mtend” (230) and make their choice accordingly. Grofman 
sets an inequality whereby a voter would prefer party L to party R iff 
L shifts the status quo closer to his or her ideal pomt than party R does 
(233, Eq. 1). Each party in the model has a performance weight, and 
it is predicted to shift the status quo m the directon of its platform 
according to its performance weight. 
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ening of the Danish Progress Party and its Norwegian 
counterpart in the 80s (126-27). 

The incentive for moderate voters to vote for ex- 
treme parties is magnified under coalition govern- 
ments, as well as under other institutionalized mech- 
anisms of power-sharing such as minority governments 
and opposition control of committee chairmanship. It 
potentially leads to what I refer to as compensatonal 
voting; in order to compensate for the watering down 
of their vote by the institutional power-sharing, when 
facing a powerful Right, left-leaning voters may be 
more likely to vote for the extreme Left even if their 
positions are closer to the moderate Left. However, 
in the case of a single-party government, outcome- 
oriented voters might be better off voting for the party 
whose positions are most similar to their own views and 
so represents them best; they have less of an incentive 
to compensate by “overshooting.” I refer to this as rep- 
resentational vote. Generally, other things equal, the 
more power-sharing allowed by the electoral system, 
the greater the incentive for voters to prefer a party 
whose position is different from their own. The strat- 
egy underlying voter behavior, then, is institutionally 
dependent. 

The counterfactual thought-experiment described 
above and the evaluation of parties’ marginal impact 
in the Compensational Vote Model follow a logic sim- 
ilar to the counterfactual alluded to by Downs and 
Grofman. Once modeled explicitly, embedded in insti- 
tutional context, and applied empirically, it provides 
leverage for understanding cross-system regularities in 
voter behavior unaccounted for by current theories. I 
turn now to presenting these layers in detail. 


Assumptions 


The model is decision theoretic. Although voters under 
my framework are forward-looking, they do not nec- 
essarily possess knowledge or expectations regarding 
what all other voters choose and coordinate their be- 
havior accordingly. This setup is reflected in both the 
theoretical modeling and the empirical analyses that 
follow. In addition, the model (as well as the intuition 
above) is in one dimension. Neither the theoretical 
results, nor the empirical ones, however, hinge on its 
being unidimensional.? 

The model relies on three additional assumptions re- 
garding information voters possess. First, I assume that 
voters hold positions on the relevant issues. Second, 
I assume that they have a perception of parties’ posi- 
tions. The perception, however, need not be “correct” 
(Westholm 1997). Finally, I assume that voters hold 
a belief about the p and nature of power 
sharing—a belief as to whether the party winning the 
prime ministry will be able to govern alone or will need 
to bargain with others—as well as about the distribu- 
tion of power among parties. My empirical analysis 


> This assumption can be relaxed without a quahtatrve change of the 


results. It is possible, for example, for a voter to vote for the party 
spatially closest to him or her on some dumensions but employ a 
compensational strategy on other 
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below examines voter strategy in four polities. Focusing 
on two of the cases, I examine the plausibility of this 
last assumption. 

Leading a majoritarian system in which a coalition 
has not emerged since the war, a British prime minis- 
ter can implement his or her party’s preferred policy 
almost as is; there is little pulling by the opposition 
parties.* Norway, on the other hand, is a proportional 
system where the vast majority of elections result in 
either coalitions or minority governments (see Strøm 
and Leipart 1993). British voters observe a single-party 
government election after election, while for Norwe- 
gians bargaining in the Storting is a norm. Signals to 
voters, then, are clear; Norwegian or British voters 
need not be highly informed or especially sophisticated 
to realize whether power-sharing is likely to emerge 
after the elections. 

As for the distribution of power among parties, in 
the case of consensual democracies, voters, I assume, 
hold predictions (or behave as if they hold ones) about 
the nature of the distribution of power and the coali- 
tion that is likely to emerge. To capture what these 
predictions might be, I employ alternative measures 
of party impact on policy, which serve as proxies for 
power distribution and for coalitions that might evolve, 
accounting for different ways in which voters might 
perceive the composition of power. 


The Compensational-Vote Model 


Under the proximity model, the utility of voter i (i = 
1,...,m) for party jG =1,...,7) is inversely related 
to the ideological distance between i and j. In one 
dimension, 


U; = —Bi(v, —p,). (1) 


where v, is the ideal point of voter i, p, is the position 
of party j, and £; is an unknown constant. 

As mentioned above, representation or expression 
of opinions is only one motivation for choosing one 
party over another. Voters might also use their vote 
to shift policy outcomes toward their ideal points. Ac- 
cording to this logic, they reward parties that pull out- 
comes in their direction and penalize parties that pull 
outcomes away from them. How do voters perceive 
political outcomes? One possibility is that voters utilize 
a “naive understanding of democracy” in which policy 
outcome is a weighted average of policy positions of 
parties in the legislature, where the weights are the 
relative impacts of the different parties. Policy outcome 
is then calculated by 


Pa) sp, (2) 


jal 


where s, is the relative impact of party j, such that 
Zas =1 and s, € [0, 1) V,. The relative impact of 
each party depends on the distribution of power in the 


4 The Lib-Lab pact in 1977-78 is an exception. 
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parliament, and in particular, on the composition of the 
parliament, the composition of the governing coalition, 
and portfolio allocation within the government. While 
theories of bargaining and coalition formation are not 
within the scope of this study, I address this issue here 
briefly. 

One might wonder why I allow all parties in the 
legislature to have an impact greater than zero. After 
all, some parties are highly unlikely to be members of 
the government. There are several motivations for not 
completely discounting the role of small nongovern- 
mental parties. First, in some cases opposition parties 
are often partners in ad hoc pacts with factions of 
the governing coalition and thereby get part of their 
agenda implemented. Second, in some systems (e.g., 
Norway), the governing coalition is often a minority 
government, such that it structurally depends on fac- 
tions of the opposition supporting it (Strøm and Leipart 
1993). Third, some parliamentary systems allow the 
Opposition to have partial control over policy forma- 
tion via authority in permanent legislative committees 
(including committee chairmanship), conferences be- 
tween parliamentary leaders, and other mechanisms 
(Strøm 1990, 207-11). Finally, even uninvolved in the 
policy process itself, opposition parties affect policy 
outcomes by shaping the discourse, placing issues on 
the agenda, and forcing established parties to address 
these issues. (See Avakumovic 1978 for discussion of 
the effect of the CCF-NDP in Canada and Meguid 2002 
for the impact of rising parties in western Europe.) 

When assessing each party, outcome-oriented voters 
may entertain a counterfactual: how did politics look in 
the old days when nobody cared about issue X? Utility 
from policy-motivated voting is then represented by 


U, = Boi (v1 — P_p, y-M- PY’), (3) 


where fz is an unknown constant and P_p, is a counter- 
factual policy outcome—an outcome that would have 
been produced had all parties except party j taken part 
in the policy-formation process: 


1 
Poo, = | 55, | sere (3a) 
iy kA 


The intuition behind the bracketed term in Equa- 
tion (3) is a counterfactual analysis in line with Downs’s 
(1957) counterfactual discussed above (40). If party j 
pulls the outcome closer to the voter, this term is posi- 
tive. If j pulls it away from the voter, it is negative. The 
voter’s utility for party j approaches maximum when P 
approaches the voter’s bliss point and P_,, is far from 
it. Since the model describes an outcome-oriented yet 
naive voter, I assume that (from the voter’s point of 
view) in j’s absence other parties do not relocate to fill 
the “vacuum,” nor do their impacts change relative to 
one another. Note that this formulation does not imply 
that the voter believes the impact of his or her partici- 
pation in the elections to be the presence or absence of 
the party from the map. Rather, he or she is concerned 
with the impact of the party. P — P_p, is the marginal 
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impact of party j in policy space, and hence 


[vi — Pp) — v- PY] 


is the marginal benefit of party j for voter i. 

I incorporate the two motivations and normalize 
ßı + $ to 1. In addition, I include a parameter (6) 
that indicates the salience of issue voting? and allow 
for individual-level background variables such as so- 
cioeconomic background to affect the utility for each 


party in a different fashion: 
U; = 6{—B(y, — Pj y’ = (1 = BI — PY 
=(= Pp, J) + 2), (4) 


where £ e (0, 1) is a relative weight of the two compo- 
nents of voter utility such that the more proximity-lead 
is voting, the larger is 8. Finally, 6, is a vector indicating 
the effect of background variables z, on voter utility 
for party j. 

Interpreting £ is key, and while voter behavior is 
represented as a combination of two motivations with 
p as a mixing parameter, examining the two extremes 
is helpful for interpretation of the entire range. If vot- 
ing for a party whose positions differ from those of 
the voter bears no psychological/representational costs 
such that voting over policy outcomes is the sole con- 
sideration in mind as far as issues are concerned, f 
will be arbitrarily close to 0, and so the vote will be 
almost entirely compensational. When the proximity 
component is dominant £ will approach 1, and depend- 
ing on the institutional context, two interpretations are 
possible. An empirically large 6 can be an indication 
of a voter expressing his or her views and thus voting 
for the party closest to him or her (representational 
vote), ignoring considerations of power-sharing and 
hence the postelection policy-formation process. Alter- 
natively, in majoritarian systems with little bargaining 
in the policy-formation process, a large $ can result 
from policy-oriented vote (as only little compensation 
is needed). 

To illustrate the calculation in Equation (4), imagine 
a three-party legislature with parties A, B, and C (s4 = 
1—sg—Sc). In this case, P_p, = [ss/(sp+5c)|pa + 
[sc/(Sp + Sc)|pc, and by substituting this into Equa- 
tion (4), voter i’s utility for party A can be expressed 
as 


Ua = —B(v, — pa)’ — (1 — Al -PY 
— (v: — Pip, VI} + 28a 
= — 6(v, — pa) -X1 — P) 
x (vı — SAPA — SBP B — S&P c) +1 — P) 


2 
SB SC 
x {y — ee re 
(» aon re) +264. (5) 





5 In a multicrmensional setup with d dimensions, 6 1s a d x 1 vector 
where each element represents the salience of a certain issue area. 
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Calculating the utility for party B as in Equation (5) 
and taking the difference between the two gives the 
net utility of voting for A versus B: 


U,,A-B = Ua — Usp 
= [v — ps)’ — (vi — pay’) 
+41 — Pi — Ppa) — Vi — P_pp)’] 
+ z,(54 — 8B). (6) 


Differentiating Equation (5) with respect to p4 and 
setting the result to 0, we get the optimal placement of 
pa for voter i (second-order conditions established in 
Appendix I): 


x B(sa—1)—Sa 


py (1 — B)s4 (Sapp +Scpc) 
a *B(4A 1-9, 


B(s,—1) -=s 
(7) 


For a clearer intuition of the solution, consider the two 
extremes. When 8 — 1 (representational voting), not 
surprisingly, the prediction of the model reduces to the 
proximity prediction: 


PA(8) — a i vi. (7a) 


When £ — 0 (compensational voting) it reduces to 
; v, — (SBPB + ScDc) 
> 7b 

PAP) 7 Sa (7b) 


That is, when voting is purely compensational, the ideal 
placement of party A is the mirror image of policy 
outcome produced by the combination of parties B 
and C alone weighted by the impact of party A. Other 
things equal, the less powerful party A is (the smaller 
the denominator in Equation [7b]), the farther away 
it has to locate in order to shift policy outcome in its 
direction. In addition, holding party impact constant, 
the more extreme parties B and C are, the more ex- 
treme the voter would like party A to locate in the 
opposite direction with respect to his or her views in 
order to balance the other two parties. In the extreme 
case of a 8 arbitrarily close to 0, the ideal placement 
of party A is the point in the policy space that yields 
a policy compromise P identical to the voter’s bliss 
point. Given p4 (Equation [7b]), that compromise is 
expressed by 


v, — (Sapa + Sepe) 
SA 


+sapp+$cPc =. (7c) 


Plp+0pamps = SA 


When £ is marginally close to 0 (i.e., voting is al- 
most entirely compensational), voter utility still peaks 
at a point different from his or her own views and 
then declines) When a party is too extreme or pulls 
policy “too much” the benefit for the voter declines. 
Therefore, a voter’s decision to vote for a party 
that is not necessarily ideologically most similar to 
him or her is moderated by an endogenous feature 
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of my model. Even a purely compensational voter 
does not employ a “the more extreme the better” 
logic. 

The mixing parameter represents a summary of a 
nuanced political reality. In both the theoretical model 
above and the empirical model below, individuals vary 
in the extent to which they vote out of representa- 
tional or compensational considerations insofar as the 
institutional context in which they vote varies. Under 
this specification $ captures the extent to which voter 
choice in a given system is motivated by compensa- 
tional or representational considerations; the model 
does not allow for heterogeneity among voters in the 
strategy they employ. Nonetheless, in addition to vari- 
ation across institutional environments, voter charac- 
teristics within an institutional environment may af- 
fect the strategies employed. J can be interpreted, 
then, as an average of voter strategies in a given 
system. 

In addition to a summary of multiple voters, the mix- 
ing parameter can be thought of as a summary of multi- 
ple selves. As discussed above, each individual carries 
conflicting considerations; while advancing beneficial 
policy outcomes, compensational voting for a party 
whose position differs from the voter’s position can 
be psychologically costly, and conversely, while carry- 
ing psychological representational benefits, proximity 
voting does not necessarily advance one’s policy inter- 
ests. The two motivations often translate to conflict- 
ing strategies. The mixing parameter, then (even when 
estimated at the individual level), also represents the 
relative extent to which an individual’s choice is guided 
by compensational versus representational considera- 
tions. 


FROM INSTITUTIONS TO VOTERS: 
OBSERVABLE IMPLICATIONS 


The core of the model leads to my empirical predic- 
tions. If, in addition to party platforms, voters are con- 
cerned with policy outcomes, they will not necessarily 
vote for the party whose positions are most similar to 
their own positions, but rather, they will compensate 
for postelectoral bargaining resulting in watering-down 
of their vote and will often prefer parties whose posi- 
tions differ from their own. Indeed, as I discuss above, 
because opposition parties affect policy indirectly by 
placing issues on the agenda, even in a hypothetical 
case of pure majoritarian regime I expect to find voters 
employing a mixed decision rule. 

How does the argument depend on institutional en- 
vironments? In my analysis, voters vote over parties 
available on the ballot but are concerned with policy 
outcomes. The conversion mechanism from parties on 
the ballot to policy outcomes varies greatly by insti- 
tutional context. In some institutional environments 
political bargaining is an everyday matter, while in 


6 Unlike in the Directional Model, where the moderation depends 
on the exogenously posed constraint, the region of acceptability, 
moderation 1s an endogenous feature of the Compensational Vote 
ModeL 


190 





May 2005 


FIGURE 1. From Institutions to Voters 


Be (0, 1): 


others the winner can implement his or her ideal pol- 
icy with little compromise. Figure 1 summarizes the 
institutional component of my argument. When insti- 
tutionalized bargaining takes place voters compensate 
for the watering-down of their vote by voting for a party 
whose positions are different from, and are often more 
extreme than their own ideal points. Since voters utilize 
their vote to affect policy outcomes, the more power- 
sharing allowed by the institutional environment, the 
more voters will compensate for the watering-down 
of their vote by voting for parties whose positions 
differ from their own positions. Therefore, empiri- 
cally, I expect B to decrease with institutional power- 
sharing. 

This prediction also observationally distinguishes my 
framework of outcome-oriented behavior from exist- 
ing theories of issue voting. If voters are concerned with 
either proximity or direction of platforms, the same 
regularities should hold irrespective of post-electoral 
bargaining. If, on the other hand, voters vote to achieve- 
preferred policy outcomes, their taste for compensa- 
tional versus representational strategies will vary with 
institutional context as specified above. 

Compensational voting is observationally distin- 
guishable from directional voting in three additional 
ways. The model predicts that the extent to which vot- 
ers employ compensational strategy depends on insti- 
tutional context. However, it is likely that the taste fora 
particular voting strategy also varies across individuals 
within a given system. Indeed, allowing for individual- 
level heterogeneity in £, I demonstrate that in the 
Netherlands, the higher the level of education and the 
weaker the attachment to a party, the more compen- 
sational is one’s vote. Behavior of those less educated 
and strongly attached to a party is more likely to fol- 
low principles of proximity voting (Kedar 2003).’ Con- 
versely, Macdonald, Rabinowitz, and Listhaug (1995) 
predict that compared to proximity voting, directional 


7 The average of strategies across individuals is similar to the results 
reported here. 
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voting will decrease with political sophistication.’ The 
divergence of the two predictions is not surprising; 
prompted by symbols, directional voting is emotion- 
ally driven and places modest cognitive requirements 
on voters compared to proximity voting (456). Com- 
pensational voting, on the other hand, while placing 
relatively modest informational demands on voters as 
specified in the assumptions above, is still more de- 
manding than proximity voting. 

In addition, under the Compensational Vote Model, 
since voter utility for each party—and hence voter 
choice—depends on predicted outcomes, it depends 
on the configuration of other parties in a way that it 
does not under directional voting. As discussed above, 
a Left-leaning voter might prefer to endorse the ex- 
treme left when predicting an outcome to his or her 
right but might endorse the moderate left (or even a 
party to his or her right) when predicting that “over- 
shooting” is not necessary (or is even necessary in 
the opposite direction). Finally, compensational voting 
does not specify a neutral point that (in one dimen- 
sion) divides the ideological continuum into two, and 
therefore, unlike under directional voting, centrist vot- 
ers will not necessarily be indifferent among parties, 
in particular, between moderate and more extreme 
parties. 


EMPIRICAL ANALYSIS 


Selection of Cases and Data 


With the institutional hypothesis in mind, I select my 
cases. [ examine four parliamentary polities that vary 
in their institutional design: Britain, Canada, Norway, 
and the Netherlands. The former two represent majori- 
tarian systems, while the latter two represent consen- 
sual systems. Obviously, the four vary institutionally in 
different ways. Deducing differences from principles 
of majoritarianism and power-sharing, Lijphart (1984) 
famously specifies 10 indicators that cluster into two 
dimensions of institutional features of democracies, 
offering a subtle classification of democratic systems 
(see also Powell 2000). Four cases do not allow me to 
parse out the effects of the nuanced institutional mech- 
anisms. However, examining the electoral systems in 
these polities and the way they score on Lijphart’s Ex- 
ecutive Parties dimension indicates clearly that Britain 
and Canada are highly unitary, while the Netherlands 
and Norway are highly consensual. 

I utilize surveys conducted by the British Election 
Study (Heath et al. 1987) and the Norwegian Elec- 
tion Study (Aardal and Valen 1989), as well as the 
Comparative Study of Electoral Systems survey con- 
ducted in Canada (1997) and the Netherlands (1998; 
The National Election Studies 1995—98). Also, I utilize 
data about the results of these elections: vote shares, 
distribution of seats in the parliament, and portfolio 
allocation in government. 


8 Contrary to their prediction, the authors find that directional voting 
dammates prommity voting regardless of voter sophistication 


Vol. 99, No. 2 


In addition to the institutional variation, the cases 
exhibit variation on an additional dimension. As men- 
tioned above, issue voting has become a more signif- 
icant determinant of voter choice in many Western 
democracies. The selection of cases from both the 1980s 
and the 1990s allows me to at least partly control for this 
change, allaying the concern that voter employment of 
certain strategies depends on the overall importance of 
issues. 


Measurement 


Question wording for each of the surveys employed 
is available at the author’s URL (see above). Opera- 
tionalization of two concepts—party position and party 
impact—imerits a separate discussion. 


Party Position. ‘The choice of measure of party po- 
sition relies on both theoretical and empirical consid- 
erations. Since voters are the focus of this study as 
well as of the theories from which this study departs, I 
conduct most of the analysis measuring party position 
as perceived by the individual voter (measured in the 
relevant survey). This procedure follows Blais et al. 
(2001), Westholm (1997), and many others. In particu- 
lar, Westholm (1997) writes, “Although voters may at 
times be mistaken about these locations, it is their per- 
sonal beliefs... that will guide preference formation” 
(870).1° 

Still, the skeptic might argue that using perceived 
party positions may bear the risk of projection bias; to 
the extent that voters tend to perceive a party closer 
to their own position on an issue when they support 
the party, the results might overstate support for the 
Proximity Model. However, accounting for such sup- 
port by including control variables in the estimation 
(such as union membership, which is likely to make 
one feel close to the Labour Party; church attendance, 
which fosters closeness to the Christian Democrats; 
and the like), I reduce the risk of projection bias (see 
also Blais et al. 2001 for a similar argument). Finally, 
under the worst-case scenario, to the extent that pro- 
jection bias still exists, using this measure simply means 
a conservative test for my theory: the empirical anal- 
ysis is less likely to yield support for my model and 
more likely to support the restricted model—the Prox- 
imity Model. Overall, the analysis of multiple cases 
mitigates methodological concerns often mentioned in 
the proximity—directional debate; in line with previous 
studies (Merrill, Grofman, and Adams 2001), I have 
neither theoretical nor empirical reason to suspect that 
my methodology biases the results in different direc- 
tions across the four systems in a way correlated with 


? For simplicity, the results, for Canada below include 
all provinces with the exception of Quebec and therefore the Bloc 
Québécois. However, both an analysis of voting behavior in Quebec 
and a nationwide analysis produce results similar to those reported 
here 


10 In addition, Blais et al. (2001) write about the use of aggregate 
measures of party position: “This does not make theoretical sense 
because there is no reason to believe that voters react to an aggregate 
score of which they are unaware” (85). 
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institutional design." Nonetheless, also I conduct some 
empirical tests utilizing average perceived party place- 
ment as a measure of party ideology, as well as a set 
of analyses using thermometer rating toward parties as 
the dependent variable. 

As the theoretical model, the empirical analysis, 
too, is unidimensional. In three of the cases (the 
Netherlands, Canada, and Norway) the survey offers 
a general left-right placement scale.“ In the fourth 
case, Britain, the survey offers seven items for self- 
placement instead. However, in the British election 
under study (1987) the three parties align in the same 
order across all seven items, and so reduction of the 
analysis to one dimension is possible. 


Party Impact. Since voters are the focus of this study, 
the measures of party impact on policy employed is 
a reflection of voter perception of patty impact. As I 
mention above, although important in itself, theoreti- 
cal accounts of parhamentary bargaining, party impact, 
and intracoalition bargaining are within the scope of 
this study. Yet, to establish the robustness of my find- 
ings and to allay a potential concern that my results 
are an artifact of measurement decisions, I conduct 
my analysis using three alternative measures of party 
impact. I describe these measures below. 

Following the naive view of democracy, I first use a 
simple (probably the simplest) approximation of party 
impact that voters might entertain—-party seat share in 
the parliament. It is often the case, however, that public 
opinion polls prior to the elections report the expected 
popular vote rather than the expected seat-share, and 
thus voters’ expectations regarding the results of the 
election are based on these reports. The second mea- 
sure I employ, then, is the actual popular vote, which, I 
assume, is a proxy for the average public opinion poll 
prior to the elections. 

Although all members of the legislature have 
some impact on policy formation, members of the 
opposition—it might be argued—are not as influential 
as their colleagues in the coalition, even controlling for 
the number of seats they hold. Similarly, parties that 
hold the lion’s share of portfolios may be more power- 
ful than junior partners in the governing coalition. 

Unfortunately, I could not find relevant surveys that 
ask voters for their prediction of the coalition that 
will emerge after the election. However, by averag- 
ing seat-share in the legislature with portfolio-share in 
the government I assign seat-share different weights, 
depending on whether the party is in the governing 
coalition or not. Parties in the opposition score 0 on 
the portfolio scale and thus their seat-share is down- 


1 In fact, in a study comparing voter placement of party positions 
in the United States, France, and Norway, Merrill, Grofman, and 
Adams (2001) report no systematic differences across the three sys- 
tems. (The authors note that Republicans perceive the Democratic 
candidats to be substantially more liberal than the Democrats do, 
and that this bias us greater than all other comparisons in their study, 
but their findings do not suggest any systematic difference across the 
three systems.) 

12 Indeed, m the first two this 1s the only available question of ideo- 
logical placement. 
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weighted, while parties in the governing coalition have 
their seat-share in parliament weighted more heavily, 
and more so the more senior in the government they 
are. The results reported below are based on an aver- 
age of the two components with a 3:1 ratio.’4 Given the 
often undervalued potential effect of the opposition 
(see discussion above and Avakumovic 1978, Meguid 
2002, Strøm 1990, and Strøm and Leipart 1993), this av- 
eraging takes into consideration the direct and indirect 
influence of the opposition on policy formation. 

In employing these measures I implicitly suggest 
that the institutional procedure leading to outcomes 
is similar across these polities. This is, of course, not 
the case. As I discuss above, numerous studies suggest 
that the executive has more impact vis-a-vis the legis- 
lature in Britain, for example, than in the Netherlands 
(Doring 1995; Lijphart 1984). Indeed employing mea- 
sures that incorporate procedural differences will pull 
the results in my favor, running the risk of employing 
a measure that assumes the answer. Given the theo- 
retical argument in this study, then, I conservatively 
employ identical measures of policy formation across 
polities. 

Notice that while the model is decision theoretic, 
incorporating parties’ expected impact into voter cal- 
culation implies an interaction among voter actions. 
The impact voters attribute to different parties, and 
hence their utility for each party, takes into consid- 
eration their beliefs on others’ behavior. Once party 
position and impact are measured, policy, as well as 
counterfactual policy, can be calculated as illustrated 
in the previous section in Equations (2) and (3a), re- 
spectively. 


The Statistical Model 


I derive a statistical model that corresponds with the 
theoretical model. First, I derive a likelihood function 
for multinomial choice: 


Im 


Lo | es or 
Im] (8) 


log L x D log ry, 


im] pol 


where the dependent variable is vote choice, such that 
y, = lifthe ith voter votes for party j G =1,2,...,m), 
and 0 otherwise, and yr, is the probability of individual 
1 (i = 1,2, ..., n) voting for party j. This probability is 
a function of his or her utility for that party and his or 


D Cooperative game theory offers combinatorial power indices, such 
as the Shapley—Shubik Power Index and Banzhaf Power Index. How- 
ever, in the two majoritaman cases where a single party holds the 
majority of the seats, these indices assign the majority party an ab- 
solute power and al] other parties no power. This 1s contrary to the 
model definiton, where s, <1, and therefore a companson across 
institutions using the indices ıs impossible. 

14 I also conducted the analysis utilizing a smple average. The gen- 
eral direction of the results holds, although the results are weaker. 
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TABLE 1. Issue Voting In Four Political Systems (0 = Compensational, 1 = Representational) 


Measure Norway" The Netherlands Great Britaln 
Seat share 0.674 0.543 0.833 
(0.564, 0.767)° (0.512, 0.572) (0.761, 0.892) 
Vote share 645 0.543° 0.730 
(0.512, 0.572) 
59 


(0.608, 0.833) 
849 


Avg. (seats, portfollos) 


“Results for Norway including the Liberal party are similar to the results reported here. For example, § based on vote share is 0.605 
with confidence Interval (0.537, 0.676). 

595% confidence Interval. Uncertainty 1s calculated by randomly drawing from the multrvanate normal distribution centered at Å with 
vanance equal to 6% (see Herron 2000 and King, Tomz, and Wittenberg 2000). General results of the estimation and model specification 
for each system are presented In Appendix II 

°Seat-share and vote share In the Netherlands are identical (with the excepton of fine-tuned rounding). Therefore, the resutts for the 
Netherlands are actually based on two, rather than three, measures. 


her utility for all other parties, such that for each voter 


ae m, =1, 


: f (uy) 
n(vote, =j) =r, = -i <, (9) 
7 VE f We) 
where I employ a logistic error structure such that 


f (a) = exp(a). 

Second, the systematic component of the statistical 
model is in parallel with the theoretical model as it 
appears in Equation (4); it contains representational 
and compensational motivations weighted by # and 
1 — £, respectively, as well as the salience parameter 
(0), and m — 1 vectors of party-specific effects (6,) of 
the background variables, z, (6, for j = 1 is set to 0 for 
identification purposes), such that each ô, is a vector 
including effect coefficients for each variable i in Z,, as 
well as a party-specific constant.» This, in combination 
with Equations (8) and (9), produces a Conditional 
Logit Model: 


= -$ - representational, — (1 — P) 
- compensational,] + 6, z 
= {—B(M — py)" — (1 - BI — PY’ 
—(v, -Pph + 8,2. (10) 


Finally, I employ both quadratic and city-block utilities, 
different model specifications, and different measures 
of the dependent variable. 

The optimization is unconstrained, so in order to 
compare the relative effects of proximity vs. compen- 
sational voting across countries I parameterize f using 
logistic transformation.'® The normalization of £ allows 
me to evaluate whether the data support the theory or 
disprove it. Estimates of 6 that are close to the upper 


15 The estimated vectors of background-variable coeffiaents (includ- 
ing party-specific intercepts) vary across polities depending on the 
relevant cleavages in each particular political system as established 
in previous research. The models are presented in Appendix I. 

16 In the theoretical model, I defined 6; + 62 = 1, so the two pa- 
rameters are reduced to one, £ € (0, 1). In parallel, in the empirical 
model I reparameterize £ using a logistic transformation such that 
B= (1+exp(—a))~!. This parameterization ensures that £ will be 
bounded between 0 and 1 





bound (high f’s with large standard errors) or estimates 
of $ in systems with a high level of power-sharing that 
are as large as or larger than £ in systems with Little 
power-sharing will lead me to infer that the data do not 
support the institutional hypothesis. On the other hand, 
estimates of £ that are systematically smaller in power- 
sharing systems than in majoritarian systems will lead 
me to infer that the data support the theory.!” 


RESULTS: INSTTTUTIONALLY 
DEPENDENT VOTERS 


To test my theory, I first estimate Equations (8) through 
(10) in each of the four polities. While the issue com- 
ponent of the model! is identical across the four, the 
background variables vary across systems depending 
on the relevant political cleavages established in pre- 
vious research. Model specification and comprehen- 
sive results of the estimations are presented in Appen- 
dix IL!8 My main quantity of interest, and the focus of 
the discussion below, however, is the extent to which 
voting is proximity-driven or compensational, as cap- 
tured by the parameter estimate 8. 

Table 1 focuses on the estimated £ in each of the 
four polities. Each column presents the results for one 
of the four polities, and each row indicates the measure 
used. How do the results vary by electoral system? In 
almost all cases, vote in the consensual systems is more 
compensational than vote in the majoritarian systems. 
Voting in Britain, for example, follows the proximity 
model more closely than voting in the Netherlands 
across all measures (0.833, 0.730, and 0.849 in the 


17 To estimate Equations (8) riire oi I multiply through and 
rearrange terms. I rewrite the brack 


A-—PPRX, — (1 — B)CMP, | = X—BPRX, — CMP, + CMP, ) 
= AP(CMPy — PRX,) — CMP j). 


I then multiply through and estimate the model in the form 
GX CMP, — PRX,) —@-CMP,, which allows separate identification 
of B and 8. In combination with the description in footnote 16, 
I maximize the likelihood function with respect to a, 6, and the 
vector 6. 

18 Results of all coefficients estimated in the 12 sets of estimation 
(four cases times three measures of party impact) can be obtained 
from the author. 
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former and 0.543, 0.543, and 0.597 in the latter), and 
the same pattern evolves from a comparison of the 
Netherlands with Canada (0.770, 0.613, and 0.880 in the 
latter). The results for Norway are somewhat weaker 
yet still in the expected direction. The results produced 
by utilizing the seat-share measure indicate that voting 
behavior in Norway is more compensational than in 
Britain, and, similarly, results produced by the mixed 
measure indicate that voting in Norway is more com- 
pensational than in Canada. The rest of the results 
for Norway do not allow me to generally infer that 
voter behavior in Norway is different from that in 
Canada and Britain (the confidence intervals over- 
lap substantially). Overall, however, with one excep- 
tion (vote-share measure in Norway and Canada), all 
confidence intervals in the power-sharing systems are 
centered around smaller fs than in the majoritarian 
cases. In other words, the pushing and pulling, bargain- 
ing, and compromise—all consequences of institutional 
features—are reflected in voter behavior.!? 

In addition, I conduct a set of robustness tests for 
the empirical predictions of the model. These tests in- 
clude variations on the specification reported above. 
In particular, I repeat the test for each of the three 
measures of party impact varying: (a) the utility func- 
tion from quadratic to city block, (b) the dependent 
variable to thermometer rating for parties, (c) the 
measure of party position to average perceived posi- 
tion, and (d) the addition of retrospective voting and 
candidate evaluation as further covariates. I also con- 
duct the tests with combinations of these changes: (e) 
city block—thermometer ratings, and (f) thermometer 
rating—average perceived party placement. Overall, 
the original results and these tests provide 21 differ- 
ent estimates of 6 for each polity (with the excep- 
tion of the Netherlands, where they provide 14 such 
estimates). 

In all specifications except c the point estimate of £ 
in either of the majoritarian cases is greater than the 
estimate in either of the proportional cases on at least 
two of the measures. In the cases of city-block utilities, 
thermometer ratings for parties, and the combination 
of the two, the results hold for all three measures, and 
in the cases of the original model, additional covari- 
ates, and the combination of party thermometers and 
average perceived party position, they hold for two of 
the three measures and for three of the four polities 
in the third measure, All these results are in the ex- 
pected direction, but while most of these differences 
are statistically significant, in some cases they statisti- 
cally overlap. 

The results of specification c show no systematic 
pattern. Examination of these results in combination 
with specification f (where the results are in the right 
direction but are not statistically significant for two of 
the three measures) suggests that the measure of party 


19 These results generally hold employing city-block rather than 
quadratic functions for measures of distance. Employing the seat- 
share measure, for example, the estimation produces Bweheriesdés = 
0.53 with a 95% confidence interval (0.51, 0.55), Brorway = 0 62 (0.57, 
0.68), Barman = 0.94 (0 70, 1), and Canade = 0.70 (0.55, 0 82). 
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FIGURE 2. 


Issue Voting In Four Systems 


& 





placement as average perceived placement by voters 
is the reason for the weak results. As discussed above, 
and as Blais et al. (2001), and Westholm (1997) argue, 
given that the focus of this study is individual voter’s 
choice, the appropriate measure of party position is the 
position as it is perceived by the individual voter, rather 
than an average. 

To complement these figures, Figure 2 presents den- 


` sity plots of the four estimates of $ based on the seat- 


share measure.” As the figure shows, voting in the 
Netherlands and Norway, the two consensual systems, 
is more compensational than voting in the two majori- 
tarian systems, Britain and Canada. The variance of 
ß in all four cases is relatively small (even if differ- 
ent), such that with the exception of a partial overlap 
between Norway and Canada, the proportional-repre- 
sentation cases are distinguishable from the majoritar- 
ian cases. 

In comparing the fs across the models I assume that 
the estimates of £ in the different countries are inde- 
pendent of one another conditioning on the covariates 
in each model. This implies, for example, that the taste 
of Dutch voters for compensational/representational 
strategy in the Netherlands is independent of the taste 
of Canadian voters for these strategies. Indeed, it is 
hard to imagine why there would be dependence in 
taste for vote-choice strategy among voters in differ- 
ent polities, conditioning on systematic factors of voter 
choice. Relying on this assumption, and in addition 


2 The estimation procedure itself is described above. Postestimation 
random draws are taken from the sample distribution, such that the 
distribution of BNwtheriands 18 centered at 0.543, etc. 


American Political Science Review 


Measure Norway 


Vol. 99, No. 2 


TABLE 2. Likellhood-Ratio Test Agalnst the Restricted Model of B=1 (Proximity Model) 
The Netherlands Great Britain Canada 
Ee = Se. o — VAa. 


Seat share 
Vote share 


Avg. (seats, portfolios) 


2 p value m 


è Seat share and vote share in the Netherlands are identical (with the exception of fine-tuned rounding). 


to the analyses above, I compute the differences in £ 
between each majoritarian case and each proportional 
case (not reported). The results are consistent with 
the results presented in Table 1. All the point esti- 
mates of the differences are positive, indicating that 
f in the majoritarian systems is greater than £ in the 
proportional systems. The Netherlands is consistently 
distinguishable from both Britain and Canada. Norway 
is not always statistically distinguishable from the ma- 
joritarian systems, although the direction of the results 
is as expected. 

Does the Proximity Model sufficiently capture the 
information in the data? To further address this issue, 
I compare the results above to a restricted model con- 
sisting of the proximity component and the background 
covariates”. Table 2 presents Likelihood-Ratio tests 
for the two models in the four systems based on each 
of the three measures. As the table shows, the restricted 
model (f = 1) is rejected in all 12 cases. 

That $ is removed from the upper bound not only 
in the power-sharing systems but also in the majori- 
tarian systems is particularly interesting. Although 
Britain and Canada are extreme cases of the majori- 
tarian models, they are not ideal types of the model 
(see Lijphart 1999, 10-21, for detailed discussion of 
Britain, and Figure 14.1 there for placement of the 
four cases on the Executive-Parties dimension). Even 
in First-Past-the-Post systems, where opposition par- 
ties do not participate directly in the policy-formation 
process, they indirectly affect policy outcomes by shap- 
ing the political discourse and forcing other parties to 
adapt their policies (Avakumovic 1978; Meguid 2002). 
By the same token, the results presented in Table 1 
show that 2 in the proportional cases is safely away 
from the lower bound—issue voting is not entirel 
compensational even in the proportional cases. This 
is consistent with both the theoretical expectation and 
the nature of the empirical cases; similar to Canada 
and Britain on the majoritarian end, both Norway 
and the Netherlands are strong, yet not pure, cases 
of consensual democracy. Finally, a comparison of 
the cases from the 1980s (Britain and Norway) with 
those from the 1990s (Canada and the Netherlands) 
does not reveal a systematic pattern with respect to 
timing. 


7! The Compensational Vote Model reduces to the Proximity Model 
when f =1. 





These results shed light on the institutional com- 
ponent of the argument and on the extent to which 
the Proximity Model captures voter behavior. Voters 
in the power-sharing systems behave differently from 
voters in the majoritarian systems. Under the former, 
voters compensate for the watering-down of their vote, 
often voting instrumentally for a party whose posi- 
tions differ from their own. Under the latter, votes 
are hardly diluted, and voters make their choice accor- 


dingly. 


DISCUSSION AND EXTENSIONS 


How do voters’ positions on issues translate into vote 
choice? The common wisdom of issue voting holds 
that voters make their choice based on their evalua- 
tions of party positions (platforms). Since, according 
to this framework, platforms are the object of voter 
evaluation, any given theory of voter choice based 
on platform evaluation should account for voter be- 
havior irrespective of the way votes are converted to 
policy. 

I demonstrated that this is not the case. Regular- 
ities of voter behavior vary by institutional context. 
In particular, voter choice reflects variation in post- 
election bargaining. This finding suggests that some- 
thing else is at work, namely, outcome-oriented be- 
havior. My framework is not only consistent with the 
unaccounted-for variation observed in current studies, 
it also allows me to reinterpret (rather than declare a 
winner in) the current debate between the two lead- 
ing theories of issue voting, proximity and directional 
voting. 

In itself, the notion that voters are concerned with 
policy outcomes is not a novel insight (Austen-Smith 
and Banks 1988; Baron and Diermeier 2001). However, 
my framework allows me to explain how variation in 
institutional mechanisms, types of democracy, and even 
individual-level characteristics account for variation in 
strategies voters employ. This article has focused on the 
first component; I have shown that outcome-oriented 
motivation leads voters in parliamentary regimes to 
endorse parties whose positions differ from the voters’ 
own positions depending on the institutional environ- 
ment in which voters make their choice. 

As formalized in this article, the theory is captured 
by a single parameter. But as mentioned above, £ 
stands for a complex reality. One can estimate, for 
example, the extent to which each voter’s motivation 
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is compensational or representational, the extent to 
which voters are compensational or representational 
on different policy dimensions, the extent to which 
supporters of different parties are motivated by com- 
pensational or representational considerations, and the 
like. 

In addition, the argument travels across types of 
democracy. In fact, it applies to any political system 
where no single player can implement his or her pre- 
ferred policy in its pure form. While the bargaining 
units in parliamentary systems are parties (or subsets 
of members of the parties) in the parliament, in pres- 
idential systems the power is distributed between the 
executive and the legislature, and in federal systems 
bargaining takes place between regional and federal 
agents. 

Indeed, scholars of presidential systems have ex- 
amined policy balancing by voters in both general 
and midterm elections (e.g., Alesina and Rosenthal 
1995 and Mebane 2000 for analysis of the American 
case). As some of these studies argue, depending on 
the voter’s position, the more extreme and/or more 
powerful the president is, the more extreme and/or 
powerful the voter prefers Congress to be in order 
to balance the president. But policy balancing in the 
United States is a specific case of policy balancing 
by voters in general—voters incorporate postelectoral 
bargaining into their choice: their vote in legislative 
elections depends on their prediction of (or in Midterm 
elections, knowledge of) partisan control of the exe- 
cutive. 

While in presidential systems voters balance the 
legislature vis-a-vis the executive, in federal systems 
the branches balanced are regional and federal gov- 
ernments. Examining electoral returns from almost 
three decades in Germany (Lohman, Brady, and Rivers 
1997), and adding analysis from the postunification 
era (Gaines and Crombez 2004, Kedar 2004), schol- 
ars have demonstrated that the party in control of the 
chancellery systematically loses seats in subsequent 
state elections. While partisanship and the economy 
explain part of the gap, controlling for differential in 
turnout between elections at the two levels, these stud- 
ies demonstrate that voters use Land elections to coun- 
terbalance the federal government and, thus, engage in 
vertical policy balancing. 

Whether parliamentary, federal, or presidential 
forms of power-sharing, the conceptual difference be- 
tween the analysis of voter choice in this study and 
the classic formulations of voter choice has to do with 
the transformation from the set of possible parties 
(platforms) to the set of possible policy outcomes. This 
study shows that while voters choose a party (a plat- 
form) from a set of competing parties, they are often 
concerned with policy outcomes. The complexity of 
policy formation and the mapping from vote choice 
to policies often result in voter preference for a party 
that is not necessarily the most ideologically proximate 
to the voter. Unpacking the fashion in which institu- 
tional mechanisms convert votes to policy outcomes 
is a crucial step toward better understanding of voter 
choice. 
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APPENDIX |: ANALYTIC SOLUTION 
Second-order condition on U,,4 with respect to p 4: 


a7 U, 
—* = 6[-26 — 2(1 — pisil. (AJ) 
OP’, 
Rewriting Equation (A1), it is easy to see that this is a maxi- 
mum: 
3U, 
— = 204 (2 — 1) — 268. 
3p 
Since 0 < £ < 1and 9 > 0, both expressions are negative, and 
therefore 3 Ua/3p? < 0. Differentiating U, with respect to 
Pa, we get 


dU, 


ee = 201 — B)ss(¥, — PASA — P B5B — PcSc) 
PB 

261 ~~ B)Sp (v — +e pec) 
i AAD) 
S53 + Sc 


Setting the result to 0 and solving for pz: 


(1 — B)sa(¥; — sapa — sæ c) — (1 — B)sp 
x {v, — [sap c/ss + Sc]})/(SB + 5c) 


—(1 — Bs + [(1 — B)sp/(s8 + 8c] 
Second-order condition with respect to pp: 


soa ng ta] 


(sp + $c)* 


pa== (A3) 


3 Us E 
ap, 





(A4) 


Rewriting Equation (A4), it is easy to see that this is a 


Ua 1 
3p = 201 - Oe (Go -1). 


Since all elements are positive and 0 <sg+sc <1, #Uy4/ 
3p > 0. These results are symmetric with respect to pc. 


(A5) 


APPENDIX II: COMPREHENSIVE RESULTS 


Table A1 presents results for the power-sharing cases—the 
Netherlands and Norway—and Table A2 presents results for 
the majoritarian cases—Britain and Canada. Each model 
introduces an estimate for compensational-representational 
voting, salience, and party-specific effects of the background 
covariates (as in Equation [10]). Coefficients of the covariates 
represent effects of the respective variables on the log odds of 
voting for the relevant party compared to the reference party 
(coefficients of the reference party are normalized to 0). 
Model specification for each polity depends on the relevant 
political cleavages established in previous research. Model 
specification for Canada is based on Clarke, Kornberg, and 
Wearing (2000) and Johnston et al. (1992). The results in- 
clude all regions but Quebec, which I estimate separately. 
Results for Quebec (not reported here) are similar to the 
general results presented here. Model specification for the 
Netherlands is based on Dorussen and Taylor (2001), Quinn, 
Martin, and Whitford (1999), and Taylor and Dorussen 
(2000). The model for Bntain is based on Alvarez, Nagler, 
and Bowler (2000). I also draw on Whitten and Palmer (1996) 
for model specification in the Bntish and the Dutch cases. 
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TABLE A2. Votlng Behavior In Two Majorttarian Systems 


Representational/ 
Compensational 
(8) 


Britain (1987) 
0.833 (0.033)* 
0.143 (0.007) 
Cons /Lib. Labour/Llb. 


Salience (6) 


South 
Midlands 
North 

Wales 
Scotland 
Unlon member 
Public sector 


—0.378 (0.210) —0 473 (0.214) 

0.186 (0.215) 0.275 (0.205) 
—0.136 (0.362) 0.748 (0.302) 
—0.553 (0.261) 0.125 (0.247) 
—0.771 (0.132) 0.211 (0.102) 

0.081 (0.117) 0.126 (0.132) 
Blue-collar 0.208 (0.114) 
Woman 0.211 (0.080) 0.071 (0 099) 
Age —0.001 (0.004) —0.014 (0.005) 
Home own 0.458 (0 156). —0 500 (0.111) 
Family Income 0.102 (0.027) —0.081 (0.029) 
Education —0.145 (0.242) —0.050 (0.278) 
Constant 0.427 (0.565) 0.996 (0.668) 


log IIkellhood = 1,023.02, N = 1716 


—0.512 (0.214) —0.863 (0.207) Age 
Education 
Income 
Woman 
East 
Ontarto 
Prales 
0.951 (0.130) BC 
Catholic 
Mamed 
Union member 
Unemployed 
Constant 


Canada (1997) 
0.770 (0.047) 
0.132 (0.014) 


RCA/L PC/L NDP/L 


—0.003 (0.008) 0.016 (0.009) 0.007 (0.009) 
—0.097 (0.090) 0.190 (0.088) 0.006 (0.096) 
—0.106 (0.108) —0.082 (0.117) —0.089 (0.117) 
0.157 (0.251) 0.450(0.270) 1.124 (0.288) 
—0.384 (0.532) 0.251 (0.275) 0.739 (0.182) 
—0.859 (0.430) —0.531 (0.263) —0.623 (0.224) 
0.858 (0.399) —0.122 (0.262) —0.013 (0.247) 
0.527 (0.422) —1.751 (0.389) —0.401 (0.250) 
—0.538 (0.228) —1.040 (0.242) —1.209 (0.299) 
—0.105 (0.140) 0.502 (0.136) —0.237 (0.147) 
0.138 (0222) 0.314(0.215) 1.000 (0.217) 
0.239 (0.680) 0.783 (0.770) 1.988 (0.631) 
1.451 (0.807) —1.977 (0.563) —0.711 (0.851) 


lIkellhood = 393.77, N = 429 


Note Standard errors In parentheses. For Brian Cons, Conservatves, Lib, Liberal Alliance For Canada: L, Liberal Party of Canada, 
NDP, Canada's New Democrate Party, PC, Progressive Conservatrve Party; RCA, Canadian Reform Conservatve Alllance Results do 
not Include Quebec. Estimation for Quebec, as well as for the whole country, produced similar results. 

“This coefficrent (as in Table 1) 1s calculated via logistic parametertzaton (as explamed in footnote 16). Standard errors are calculated 
via simulations (1,000 random draws) from the multtvartate normal distibution, with the vector of estimated coeficsents as the mean 


and estimated vanance—covanancs matrix as varlance. 
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characterized by simultaneous binary decisions. Applications include: voting participation games, 


T paper characterizes behavior with “noisy” decision making for models of political interaction 


candidate entry, the volunteers dilemma, and collective action problems with a contribution 
threshold. A simple graphical device is used to derive comparative statics and other theoretical properties 
of a “quantal response” equilibrium, and the resulting predictions are compared with Nash equilibria 
that arise in the limiting case of no noise. Many anomalous data patterns ın laboratory experiments based 


on these games can be explained in this manner. 


any political and social decisions involve only 

two options: to vote or not, to enter a con- 

test or not, to join an alliance or not, etc. The 
apparent simplicity of these binary-choice situations is 
somewhat misleading in that the best decision requires 
correct beliefs about others’ behavior. For instance, 
people may hesitate to go to a particular restaurant or 
bar when many others are likely to go, or as Yogi Berra 
said, “Nobody goes there anymore. It’s too crowded.” 
In other cases, the rewards associated with each deci- 
sion may be contingent on getting a minimal number 
of decisions of a certain type, e.g., the choice by a coun- 
try of whether or not to join an alliance, or jump into 
world war, or impose an embargo. A similar example 
occurs when a majority vote is needed to approve a 
legislative pay raise that each legislator would prefer 
not to support if it would pass otherwise (Ordeshook 
1986, chap 3). Sometimes the minimal number of con- 
tributors needed is only one, as in the “volunteer’s 
dilemma,” where all players are better off if at least one 
of them incurs a cost from vetoing an option, attempt- 
ing a dangerous rescue, or volunteering to perform a 
task that benefits them all (Diekmann 1985). 

In all these examples, the question is whether inde- 
pendent choices made by different people will some- 
how generate the “correct” amount of participation or 
whether the inability to coordinate will lead to defi- 
ciencies such as excess entry or crowding, insufficient 
effort to produce a public good, or the failure of anyone 
to initiate an action that benefits all. Another interest- 
ing issue is how aggregate behavior patterns respond 
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1 Entry decisions in this context are known as the “El Farol” 
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to changes in the number of people involved and the 
relevant costs and benefits of participation. 

This paper is motivated in part by the surprising and 
sometimes anomalous behavior patterns observed in 
many laboratory experiments that involve simple bi- 
nary choices. For example, Kahneman (1988) reports 
an experiment in which the number of people who de- 
cide to enter was approximately equal to a “capacity” 
parameter that determined whether or not entry was 
profitable. He remark, “To a psychologist, it looks like 
magic.” Subsequent experiments have been based on 
similar models, and the general finding is that play- 
ers are able to coordinate entry decisions in a man- 
ner that roughly equates expected profits for entry to 
the opportunity cost (Ochs 1990; Sundali, Rapoport, 
and Seale 1995). However, the “magic” of efficient 
entry coordination has been called into question by 
recent experimental results. For example, Fischbacher 
and Thöni (2001) conducted an experment in which 
a monetary prize is awarded to a randomly selected 
entrant, so the expected prize amount is a decreasing 
function of the number of entrants. Over-entry was 
observed, and it was more severe for larger numbers of 
potential entrants. This over-entry pattern is somewhat 
intuitive but contradicts the theoretical prediction that 
rewards should be equalized for the two options, inde- 
pendent of the number of potential entrants. Camerer 
and Lovallo (1999) also find over-entry when posten- 
try payoffs depend on a skill-based competition, but 
they report under-entry and positive net payoffs in the 
absence of such competition. 

Other interesting behavior patterns have been 
observed in experiments based on the volunteer’s 
dilemma, in which everyone receives a benefit if at 
least one person incurs the cost of “volunteering” but 
each person would prefer to free-ride on others’ ef- 
forts. The theoretical prediction is that an increase in 
the number of potential volunteers will reduce the 
probability that any one person volunteers, which is 
intuitive, and will decrease the probability that at least 
one person volunteers, which is unintuitive. Experi- 
mental data support the intuitive prediction but not the 
unintuitive one (Franzen 1995). Similarly, laboratory 


2 This successful coordination has been explained by models of adap- 


tation and learning (Erev and Rapoport 1998; Meyer et al. 1992) 
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results for binary coordination games and collective 
action problems support some theoretical predictions, 
but also generate intuitive data patterns that are not 
explained by standard game theory, as discussed below. 

The objective of this paper is to explore the common 
structural elements of a wide class of binary-choice 
games, and to provide a unified theoretical perspective 
on seemingly contradictory results, like the positive re- 
lationship between over-entry (or the probability of 
getting a volunteer) and the number of potential en- 
trants (or volunteers). Our approach involves relax- 
ing the extreme rational choice assumption of perfect 
maximizing behavior where people respond sharply to 
small payoff differences, which, in reality, are likely 
to be dwarfed by an array of emotions, perception 
biases, and unobserved individual differences in fair- 
mindedness, altruism, etc. Instead of trying to model all 
these dimensions explicitly, our approach is to replace 
the knife-edge responses to small payoff differences 
with “smoothed” stochastic responses that represent 
random variations in unobserved factors (Goeree and 
Holt 1999; McKelvey and Palfrey 1995; Palfrey and 
Rosenthal 1985, 1988). The broader value of this work 
is that it provides an enriched and empirically useful 
game theory that applies to the kinds of situations of 
concern to political scientists, i.e., those with a rich 
diversity of individual motivations and attitudes. In 
addition, we derive our results using a simple graphical 
device that can be used in a wide variety of seemingly 
unrelated binary-choice situations. 


TO PARTICIPATE OR NOT? 


A symmetric N-person participation game is charac- 
terized by two decisions, which we call participate and 
exit. Examples include the decision of whether or 
not to run for office, try to unseat an incumbent, or 
approach a wealthy donor seeking campaign contribu- 
tions. The payoff from participation is a function of 
the total number, n, who decide to participate, which 
is denoted x(n), defined for n < N. In a campaign entry 
game, for example, the payoff for all candidates may be 
a decreasing function of the number, n, who enter. The 
expected payoff for the exit decision is denoted c(n), 
which is typically nondecreasing in n (the number of 
players that enter). In many applications, c(n) is simply 
a constant that can be thought of as the opportunity 
cost of participation, but we keep the more general 
notation to include examples where a higher number 
of participants has external benefits to all, including 
those who do not participate (e.g., campaigning for 
civil rights; see Chong 1991). 

A strategy in this game is a participation probability, 
p €[0,1]. In order to characterize a symmetric equi- 
librium, consider one player’s decision when all others 
participate with probability p. Since a player’s own pay- 
off is a function of the number who actually participate, 


3 The participation game terminology was introduced by Palfrey and 
Rosenthal (1983) in the context of the decasion of whether or not 
to vote This model is discussed below, under Voting Participation 
Games. 
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the expected payoff for participation is a function of the 
number of other players, N — 1, and the probability p 
that any one of them will participate. Assuming in- 
dependence, the distribution of the number of other 
participants is binomial, with parameters N — 1 and 
p. This distribution, together with the underlying x(n) 
function, can be used to calculate the expected partici- 
pation payoff, which is denoted n°(p, N — 1). More 
precisely, 7°(p, N — 1) is defined to be the ex 
payoff if a player participates (with probability 1) when 
all N — 1 others participate with probability p. Simi- 
larly, c°(p, N — 1) is the expected payoff from exit 
when the N — 1 others participate with probability p. 


Equlllbrium 


In a Nash equilibrium, players choose the decision that 
yields the highest expected payoff, or randomize in the 
case of indifference. Our goal is the explanation of 
“anomalous” data from laboratory experiments, so it 
is convenient to model a type of noisy behavior that 
includes the rational-choice Nash predictions as a limit 
case. One way to relax the assumption of noise-free, 
perfectly rational behavior is to specify a utility func- 
tion with a stochastic component. For example, people 
may be motivated to vote by a sense of citizen duty 
(Riker and Ordeshook 1968), the strength of which 
may vary across individuals and across time for the 
same individual as external factors change. Thus the 
expected payoff for participation, n°, and the expected 
payoff for exit, c°, are each augmented by adding the 
stochastic term ue, where u > 0 is an “error” param- 
eter and the s, represent identically and independently 
distributed realizations of a random variable for deci- 
sion i = 1 (participate) or 2 (exit). The utility of partic- 
ipation is greater if n° + ws; > c° + uen, so that when 
u = 0 the decision with the highest expected payoff 
is selected, but higher values of u imply more noise 
relative to payoff maximization. This noise can be due 
either to errors (e.g., distractions, perception biases, 
or miscalculations that lead to nonoptimal decisions) 
or to unobserved utility shocks that make rational be- 
havior look noisy to an outside observer. Regardless 
of the source, the result is that choice is stochastic, 
and the distribution of the random variable determines 
the form of the choice probabilities. The participation 
decision is selected if n° + ws; > c* + pws OF, equiv- 
alently, if 62 — 8&1 < (n° — @)/u, which occurs with 
probability 


p| Ee Does, (1) 


where F is the distribution function of the difference 
€2 — €;. Since the two random errors are identically 
distributed, the distribution of their difference will 
be “symmetric” around 0, so F(0) = 1/2.° The error 


4 For stance, a normal distribution yields the probit model, while a 
double exponential distribution gives rise to the logit model, ın which 
case the choice probabilities are proportional exponential functions 
of expected payoffs. 

5 More formally, Pr(s; < £2) = 1/2, so Pr(s; — s2 <0) = F (0) =1/2 
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parameter, 4, determines the responsiveness of partic- 
ipation probabilities to expected payoffs. Perfectly ran- 
dom behavior (i.e., p = 1/2) results as u —> ov, since 
the argument of the F{- ) function on the right side of 
Eq. (1) goes to zero and F(0) = 1/2 as noted above. 
Perfect rationality results in the limit as u — 0, since 
the choice probability converges to zero or one, de- 
pending on whether the expected participation payoff 
is less than or greater than the expected exit payoff. 
More generally, for y > 0, Eq. (1) expresses the par- 
ticipation probability as a noisy best response to the 
expected-payoff-difference, which we also refer to as a 
“stochastic best response.” 

Equation (1) characterizes a quantal response equi- 
librium (McKelvey and Palfrey 1995) if the participa- 
tion probability p in the expected payoff expressions on 
the right is equal to the choice probability that emerges 
on the left. Without further parametric assumptions, 
there is no closed-form solution for the equilibrium 
participation probability, but a simple graphical device 
can be used to derive theoretical properties and char- 
acterize factors that might cause systematic deviations 
from Nash predictions. The graph is based on a separa- 
tion of the expected-payoff-difference from a term that 
depends only on the noise elements (yz and the distribu- 
tion of random elements). To this end, apply the inverse 
of the F function to both sides of (1) and multiply by 4 
to obtain uF—-'(p) = x°(p, N — 1) — c°(p, N—1). The 
determination of the equilibrium participation proba- 
bility is illustrated in Figure 1. As p goes from zero 
to one on the horizontal axis, zF-1(p) increases from 
—oo to +, as shown by the thick curved line with a 
positive slope in the figure.’ Since the expected pay- 
off difference is continuous in p, it has to cross the 
uF-'(p) line at least once, which ensures the existence 
of a symmetric equilibrium.’ If the expected payoff 
difference r°(p, N — 1) — c°(p, N — 1) is decreasing in 
p, the intersection will be unique. This case is illustrated 
in Figure 1, where the negatively sloped thin line at the 
left side of the figure represents the expected payoff dif- 
ference. This line intersects the “inverse distribution” 
line at the equilibrium probability labeled QRE on the 
left. Also, note that the point where the expected pay- 
off difference crosses the zero-payoff line constitutes 
a mixed-strategy Nash equilibrium, since players are 
only willing to randomize if expected payoffs for the 


6 The quantal response equilibrium, developed by political scientists 
(McKelvey and Palfrey 1995), has been applied to the study of in- 
ternational conflict by Signorino (1999). A general introduction to 
the usefulness of the quantal response approach m the analysis of 
pouucal data can be found in Morton 1999. 

To see this, note that an expected-payoff-difference of —oœ on the 

vertical axis will cause the participation probability to be zero, and 
an expected payoff difference of +-co will cause the participation 
probability to be one. This is why the thick “inverse distribution” 
line starts at —oo on the left side of Figure 1 and goes to +oo on the 
right. 
8 The existence of quantal response equilibria for normal-form 
games with a finite number of strategies is proved in McKelvey and 
Palfrey 1995, and that for normal-form games with a continuous 
strategy space in Anderson, Goeree, and Holt 2002. 
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two decisions are equal. This crossing point is labeled 
“NE Mix” in the figure.’ 

Next consider the intuition for why the quantal re- 
sponse equilibrium is not typically at the intersection 
of the expected-payoff-difference line and the zero- 
payoff horizontal line in Figure 1. With equal expected 
payoffs for participation and exit, the person is indiffer- 
ent, and since F(0) = 1/2, the stochastic best response 
to such indifference is to participate with probability 
one-half In the figure, this result can be seen by start- 
ing where expected payoffs are equal at the NE Mix 
point on the left and moving horizontally to the right, 
crossing the thick line at p = 1/2. This is not a quantal 
response equilibrium since the p we started with (at the 
NE Mix) is not the stochastic best response to itself. To 
find a stochastic best response to any given entry prob- 
ability p on the horizontal axis, first move in the ver- 
tical direction to find the associated expected-payoff- 
difference and then move horizontally (left or right) to 
the thick line, which determines the stochastic best re- 
sponse to that expected-payoff-difference. Equilibrium 
requires that the stochastic best response to the others’ 
participation probability is that same probability, which 
occurs only at the intersection of the expected-payoff- 
difference and inverse distribution lines in Figure 1. To 
summarize, a symmetric quantal response probability is 
a stochastic best response to itself, whereas a symmet- 
ric Nash equilibrium probability is a best response to 
itself. 


? All the games considered m this paper are symmetric m the sense 
that players’ payoff functions are identical We only consider sym- 
metric mixed-strategy Nash equilibria for such games. It ıs sometimes 
possible to find asymmetric Nash equilibria for symmetric games, but 
without some coordination device these equilibna seem less plau- 
sible. 

10 At the “NE Mix” point in Figure 1, expected payoffs are equal 
and any probability is a best response, so the NE Mir probability 1s 
a best response to itself. 


Anomalous Behavior in Models of Political Participation 


As long as the expected payoff difference is decreas- 
ing in p, it is apparent from Figure 1 that any factor 
that increases the expected payoff difference line for 
all values of p will move the intersection with the thick 
inverse distribution line to the right and, hence, raise 
the quantal response equilibrium probability. In an 
entry game, for example, the original x(n) function 
would be decreasing if expected rewards are decreasing 
in the number of entrants, and it is then straightfor- 
ward to show that n°(N — 1, p) is a decreasing func- 
tion of both arguments. When the opportunity cost 
payoff from not entering is constant, it follows that 
the expected-payoff-difference m°(p,N—1)—c*(p, 
N — 1) is decreasing in p and N, so a reduction in the 
number of potential entrants will shift the thin line 
in the figure upward and raise the quantal response 
(QRE) probability, as represented by a comparison of 
the high-N case on the left with the low-N case on the 


t. 

The effect of additional “noise” in this modelis easily 
represented, since an increase in the error parameter yu 
makes the zF~!(p) line steeper around the midpoint, 
p = 1/2, although it still passes through the zero-payoff 
line at this midpoint (see Figure 1). This increase in 
noise, therefore, moves the quantal response equilib- 
rium closer to one-half, as would be expected. In con- 
trast, as a reduction in » makes the zF~*(p) line flatter, 
and in the limit it converges to the horizontal line at 
zero as the noise vanishes. In this case, the crossings 
for the QRE and mixed Nash equilibria match up, as 
would be expected. 

Next, consider coordination-type games where par- 
ticipation can be interpreted as an individual decision 
of whether or not to help with a group production 
process that will only succeed if enough people help 
out. For example, participation in revolutionary activi- 
ties may be individually costly unless the movement 
reaches a critical mass. In such games, it does not pay 
to participate unless enough others do, so x(n) will 
be less than c(n) for low n and greater than c(n) for 
high n. Thus the right side of Eq. (1) is increasing in 
the probability of participation. This property may re- 
sult in multiple quantal response equilibria since there 
can be multiple intersections when both the expected- 
payoff-difference and the inverse distribution lines are 
increasing in p (see Figure 2, which shows a case with 


11 Inturtrvely, holding N fixed, a higher probability of entering means 
that more people enter, which results in a lower expected payoff 
of entry Simularty, holding p fixed, a higher number of potential 
entrants results in more entry. This can be made more precise as 
follows. suppose N is fixed and the entry probability is pı Let the 
number of entrants be determined by drawing a random number that 
is uniformly distnbuted on [0, 1] for each player. If the number is less 
than pı, a player enters; otherwise the player stays out. When the 
probability of entering increases to p2 > p1, the number of entrants 
1s at least the same as before for all possible realizations of the random 
variables, and greater for some realizations. (When a player’s random 
variable is less than p4, ıt 1s certainly less than p2, leading to the 
same entry decision, and when it lies between p; and p2, the player’s 
decision changes from staying out to entering.) Likewise, when p is 
fixed, an increase in the potential number of entrants means that 
for all possible realizations of players’ random draws, the number 
of entrants is the same or higher, which makes the expected payoff 
from entry the same or lower. 
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three intersections). With multiple crossings, any fac- 
tor that shifts the expected-payoff-difference line up- 
ward will move some intersection points to the left 
and others to the right. Thus the comparative statics 
effects are of opposite signs at adjacent equilibria, and 
we need to use an analysis of dynamic adjustment 
to restrict consideration to equilibria that are stable 
(the Samuelsonian “correspondence principle”).” A 
simple dynamit model can be based on the intuitive 
idea that the participation probability will increase 
over time when the “noisy best onse” to a given 
p is higher than p. Thus dp /dt > 0 when F((x°(p, N — 
1) —c°(p, N—1))/h) >p, or equivalently, p would 
tend to increase when n°(p,N—1)—c*(p,N-1)> 
uF-'(p) and decrease otherwise. For example, start 
at p = .6 in Figure 2, which gives a positive expected- 
payoff-difference and a stochastic best response of al- 
most .9, found by moving horizontally to the right. For 
this reason, a rightward arrow is present at p = .6 on 
the horizontal axis. The other directional arrows are 
found similarly, so there is an unstable QRE at about 
3, with arrows pointing away. In this manner it can be 
seen that the quantal response equilibrium will be sta- 
ble whenever the expected-payoff-difference line cuts 
the inverse distribution line from above. 

Note that any factor that raises the payoff from 
participation, and hence shifts the expected-payoff- 
difference line upward in Figure 2, will raise the QRE 
participation probability if the equilibrium is stable and 
not otherwise. To summarize: 


Proposition 1. There is at least one symmetric quan- 
tal response equilibrium in a symmetric binary-choice 
participation game. The equilibrium ts unique if the dif- 
ference between the expected payoff of participating and 


12 Similar dynamuc-stability arguments were used by Palfrey and 
Rosenthal (1988) and Fey (1997). 
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that of exiting is decreasing in the probability of partici- 
pation. In this case, any exogenous factor that increases 
the participation payoff or lowers the exit payoff will 
raise the equilibrium participation probability. The same 
comparative statics result holds when there are multiple 
equilibria and attention is restricted to stable equilibria. 


It is useful to begin with a discussion of entry games 
since they are the simplest application. Moreover, the 
quantal response properties for these games also ap- 
ply to the stable equilibria in more complex applica- 
tions such as threshold contribution games, volunteer’s 
dilemma, and voting. The reader who is primarily in- 
terested in one of these subsequent applications may 
wish to skip any of the later sections after reading as 
far as Proposition 2. 


ENTRY GAMES: UNDER-ENTRY AND 
OVER-ENTRY RELATIVE TO MIXED 
NASH PREDICTIONS 


A widely studied example that fits the binary-choice 
framework is an “entry” game, where the choice is 
between a risky entry decision with high potential pay- 
offs (if few others enter) and a secure exit payoff. For 
example, entry may correspond to launching a political 
campaign or filing an application for a limited num- 
ber of public broadcast licenses. There are N potential 
entrants, and we assume that if all others enter with 
probability one, the representative player would pre- 
fer to exit due to congestion, but if nobody else enters, 
then the player would prefer to enter: n°(1, N — 1) < 
c°(1, N — 1) and x*(0, N — 1) > c°(0, N — 1). Consider 
a simple three-person congestion problem where each 
person’s payoff from participation is one unless both 
others also participate, in which case congestion re- 
duces the payoff to zero. The exit payoff is c, with 
0 < c < 1. When both others participate with probabil- 
ity p, the probability of congestion is p?,son® = 1 — p?, 
which is less than the exit payoff c when p = 1 and 
greater than the exit payoff when p = 0. In this ex- 
ample and in all other applications considered below, 
the expected-payoff-difference will be continuous and 
decreasing in p, so there is a unique p* for which 


m°(p*,N—1)=c*(p*, N—1). (2) 


[For instance, in the three-person congestion prob- 
lem p* = (1—c)!*.] Since (2) implies indifference, 
it characterizes the unique symmetric Nash equilib- 
rium in mixed strategies. The net payoff for participa- 
tion, 7°(p, N — 1) — c°(p, N — 1), is decreasing in p, as 
shown by the “expected-payoff-difference” line on the 
left in Figure 1. As noted above, the crossing of this 
thin line and the horizontal line at zero represents the 
solution to Eq. (2) and is labeled “NE Mix” on the left 
side of the figure. 

In order to compare the Nash and quantal response 
equilibria, note that the thin lines representing the 
differences in expected payoffs are always negatively 
sloped in an entry game. First, consider the high-N 
case on the left, where the large number of potential 
entrants lowers the expected payoff associated with a 
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given participation probability, and the resulting mixed 
equilibrium is less than one-half. The intersection of the 
negatively sloped thin line and the increasing inverse 
distribution line determines the quantal response par- 
ticipation probability, and this intersection will be to 
the right of the mixed Nash probability. The opposite 
occurs for the low-N case on the right side of the graph, 
where the low number of potential entrants results in a 
mixed equilibrium that is greater than one-half. In this 
low-N case, the QRE probability is biased downward 
from the Nash probability. One way to understand both 
cases is to note that the effect of adding noise is to push 
the equilibrium toward one-half. 

Finally, recall that the thin lines in Figure 1 repre- 
sent the expected-payoff-difference on the right side of 
Eq. (1). At the QRE probability on the left, net ex- 
pected payoffs are negative and there is over-entry in 
this case of a high number of potential entrants. In 
contrast, the thin line lies above the zero line at the 
QRE probability on the right side, for the low-N case. 
This negative relationship between the number of po- 
tential entrants and the net returns from participation is 
consistent with the experimental results of Fischbacher 
and Thöni (2001) discussed in the Introduction.!4 To 
summarize: 


Proposition 2. Inthe quantal response equilibrium for 
the entry game, there is over-entry resulting in negative 
net expected payoffs when the mixed-strategy Nash equi- 
librium is less than one-half. The reverse effect, under- 
entry, occurs when the mixed Nash equilibrium is greater 
than one-half. 


The implication of Proposition 2 is that the quantal 
response equilibrium for the entry game is always be- 
tween the Nash equilibrium and one-half. Therefore, 
an observed participation that is more extreme than 
the Nash prediction would contradict the quantal re- 
sponse equilibrium model for any error rate, u, and 
any distribution of stochastic shocks, F. 

Meyer et al. (1992) report an experiment in which 
subjects choose to enter one of two markets. With a 
group size of six, profits are equalized, with three in 
each market, so the equilibrium probability of entry 
is one-half’ An immediate corollary to Proposition 2 
is that in this case QRE coincides with Nash and both 
predict an entry probability of one-half. This prediction 
is borne out by their data: the average of the number 
of people that enter each market is never statistically 
different from three in the 11 baseline sessions that 


13 In some games with strong strategic interactions, the “snowball” 
effects of small amounts of noise can push decasions away from the 
unique Nash equilibrium so strongly that they overshoot the mid- 
point of the strategy space, with most of the theoretical density at 
the opposite end of the set of feasible decisions from the Nash pre- 
diction. This 1s the case for some parameterzations of the “traveler's 
dilemma” (Capra et al 1999). This prediction, that the data will be 
clustered on the oppostte side of the midpoint decision from the 
Nash equilibrium, is borne out by the experimental evidence. 

‘4 Tn their game, a prize worth V is awarded randomly to one of the 
n players who purchase a lottery ticket at cost c, so a(n) = (V/n) — c. 
From this it can be shown that the expected-payoff-difference is 
decreasing ın p and N. 
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they report (see their Table 3), even when the game is 
repeated for as many as 60 periods (see their Table 5). 

Camerer and Lovallo (1999) provide support for the 
QRE under-entry prediction when the Nash probabil- 
ity of entry is greater than one-half In their experiment 
subjects decide whether or not to enter a contest with a 
fixed number, c, of prizes. The entrants were randomly 
ranked and the top c entrants divide $50 according 
to their rank, while all other entrants lose $10. The 
exit payoff is simply 0, and the equilibrium number of 
entrants is (close to) c + 5. The parameters were chosen 
such that the Nash entry probability was greater than 
or equal to one-half in all treatments.© Under-entry 
occurred in all of the eight sessions in their baseline 
treatment, which resulted in positive expected payoffs 
for entry (see their Table 4). The net expected payoff 
of entry across sessions and periods was $15, which 
translates into under-entry of one or two subjects per 
round." 

The strongest evidence for the quantal response 
predictions in Proposition 2 can be found in Sundali, 
Rapoport, and Seale 1995. In their experiments, sub- 
jects received a fixed payoff of one for exit and an 
entry payoff that is increasing in market capacity, 
c, and decreasing in the number of entrants: x(n) = 
1 +2(c — n). Thus entry in excess of capacity reduces 
payoffs below one, the payoff for exit. It is straight- 
forward to derive the mixed Nash entry probability: 
p* =(c—1)/(N-—1), which is approximately equal 
to the ratio of capacity to number of potential en- 
trants./8 The capacities for the various treatments were 
c=1,3,...,19, and with groups of N = 20 subjects, 
the Nash equilibrium probability ranged from p* = 0 
to p* = 18/19. Figure 3 shows the entry decisions av- 
eraged over all subjects, with the Nash predictions 


15 Meyer et al. (1992) also report some evidence that does not square 
with either the symmetric Nash or the quantal response predictions 
of our model. In particular, the frequency with which subjects switch 
markets ıs lower than the predicted frequency (50%). We conjecture 
that this “inertia” could be explained by an asymmetric quantal re- 
sponse equihbrium m which some people tend to enter with higher 
propabuity than others 

6 The number of prizes was either 2, 4, 6, or 8, yielding equilibrium 
numbers of entrants (c + 5) of 7, 9, 11, or 13 respectively, which are 
always greater than or equal to half the group size (14-16). 
17 Camerer and Lovallo (1992) also report a second treatment in 
which subjects are told beforehand that their performance on sports 
or current events trivia will determine their payoff This creates a 
selection bias, since people that participate in the experiment are 
more likely to think they will rank high when they enter (ie, they 
are “overconfident”), neglecting the fact that other participants think 
the same (“reference group neglect”). Camerer and Lovallo propose 
overconfidence and reference group neglect as a possible explanation 
of the over-entry that occurs in this second treatment. This explana- 
tion is quite plausible, ın that ıt 1s analogous to the failure to perceive 
a selection bias that causes winners ın a common-value auction to 
be the ones who overestimated its value. Note that overconfidence 
cannot be the whole story, however, since this bias does not explain 
under-entry m their baseline treatment. 

18 To derive this symmetnc mixed equilibrrum, note that the ex- 
pected number of other people who enter 1s (N — 1)p, so if a per- 
son enters, the expected total number of entrants is 1 + (N — 1)p. 
Then x(n) can be used to calculate the expected payoff for entering: 
x°(p,N—1) =1+2(e—1) — (N—1)2p and the Nash equiibnum 
probability of entering follows by equating this expected payoff to 
the ent payoff of one, which yields the result in the text. 
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FIGURE 3. Nash Predictions (Solld Line) and 
Observed Entry Probabilities (Dlamonds 
(Source: Sundall, Rapoport, and Seale 1995) 


data averages 
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shown as the 45° line. Since each subject participated 
in 10 “runs” and there were three groups of 20 sub- 
jects, a data point in the figure is the average of 
10 * 3 x 20 = 600 entry decisions. Note that the entry 
frequency is generally higher than predicted by Nash 
for p* < 1/2 and lower than predicted for p* > 1/2, in 
line with the quantal response equilibrium predictions. 

To summarize, the quantal response analysis ex- 
plains the “magical” conformity to Nash entry predic- 
tions (e.g., Meyer et al. 1992), the under-entry in the 
Camerer and Lovallo 1999 baseline, the over-entry with 
many potential entrants observed by Fischbacher and 
Thöni (2001), and the systematic pattern of deviations 
from Nash predictions reported by Sundali, Rapoport, 
and Seale (1995). This general approach can be adapted 
to evaluate behavior in other contexts where payoffs 
for one decision are diminished as a result of congestion 
effects, as the next section illustrates.” 


THE VOLUNTEER’S DILEMMA 


There are many situations in which a player’s decision 
to participate benefits others. In collective action prob- 
lems, for instance, the contributions of some have pos- 
itive returns for everyone involved, and these returns 
are increasing in the number of contributors. In some 
contexts, the critical number of participants is one, e.g., 


19 The analysis presented here does not apply directly to the ex- 
permments reported ın Ochs 1990, since his expenments involved 
more than two market locations, each with a different “capacity” 
that determined the number of entrants that could be accommodated 
profitably. Nevertheless, the data patterns with random regrouping 
(“high turnover”) are suggestive of the quantal response results de- 
rived here The locations with the most capacity (and high probabil- 
ities) consistently have a lower frequency of entry than required for 
a mixed-strategy Nash equilibrium, whereas the opposite tendency 
was observed for locations with the capacity to accommodate only 
one entrant profitably. 
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TABLE 1. Frequencies of Individual 
Volunteer Decisions (p) and of “No 
Volunteer” Outcomes 

N p P(No Volunteer) 
12 

07 

.06 

13 


.65 
08 
43 
25 
elo) 
30 
-20 
„35 





when a volunteer is needed to issue a politically risky 
veto or sanction a group member who violated a norm. 
The dilemma in these situations is that volunteering 
is costly and players have an incentive to free ride on 
others’ benevolence., 

In the volunteers dilemma game studied here 
(Diekmann 1986), all players receive a benefit B if at 
least one of them incurs a cost, C < B. In this case, 
the expected payoff of participation, or “volunteer- 
ing,” is simply a constant, B — C. The expected pay- 
off from “exiting” follows from the observation that 
when the N —1 others volunteer with probability p, 
there is a (1 — p)"—! chance that no one volunteers, so 
(p, N —1) = B(1 — (1 — p¥™+). Note that the vol- 
unteer’s dilemma game satisfies the assumptions under- 
lying Figure 1, i.e., the difference between the expected 
payoff of participating and that of exiting is decreasing 
in p. The Nash probability of volunteering follows by 
equating these expected payoffs (as per Eq. [2]) to 


obtain 
P C nt 
p =1- ($) (3) 


This probability of volunteering has the intuitive prop- 
erties that it is increasing in the benefit, B, decreas- 
ing in the cost, C, and decreasing in the number of 
potential volunteers, N. However, the probability of 
getting no volunteers is (1 — p*)™. By Eq. (3) the 
probability of getting no volunteers in a Nash equi- 
librium is (C/BY/A-)), which is increasing in N, with 
lim y+coP(No Volunteer)=C/B > 0. Unlike the in- 
tuitive comparative statics properties mentioned be- 
fore, this prediction is not supported by experimen- 
tal data. Table 1 reports experimental results for a 
one-shot volunteer’s dilemma game with B = 100 and 
C = 50 (Franzen 1995). Note that the probability that 
any person volunteers is generally declining with N, 
as predicted by Nash.” The probability that no one 
volunteers, however, is decreasing in N and converges 
to zero instead of C/B = 1/2. 

Next, consider the quantal response equilibrium for 
the volunteer’s dilemma. Since the difference between 
the expected payoff of volunteering that of and exiting 


2 Franzen (1995) reports that the group-sre effect 1s significant at 
the 5% level using a chi-square test with seven degrees of freedom. 
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is decreasing in the probability of volunteering, Propo- 
sition 1 implies that the QRE probability of volunteer- 
ing is unique, decreasing in N and C, and increasing 
in B. Interestingly, the introduction of (enough) en- 
dogenous noise reverses the unintuitive Nash predic- 
tion that the probability of “no volunteer” increases 
with N. 


Proposition 3. In the quantal response equilibrium 
for the volunteers dilemma game, the probability that 
no one will volunteer is decreasing in the number of 
potential volunteers for a sufficiently high error rate, 
u. Furthermore, limy..P(No Volunteer) =0 for any 
u >Q. 


The proof of Proposition 3 is given in the Appendix. 
The intuition is that, in the presence of noise, the ad- 
dition of potential volunteers only results in a small 
reduction in the probability of volunteering, and the 
net effect is that the chance that someone volunteers 
will rise.”2 

The unintuitive feature of the Nash equilibrium for 
the volunteer’s dilemma (i.e., that the probability of 
getting no volunteer increases with N) parallels the 
result that the chance of convicting an innocent de- 
fendant under the unanimity rule (i.e., no acquittal 
votes) rises with the size of the jury (Feddersen and 
Pesendorfer 1998). The models differ in that jurors 
receive private signals about the likelihood that the 
defendant is guilty. In the Nash equilibrium, those that 
receive a guilty signal vote to convict while those with 
an innocent signal randomize between voting to convict 
or to acquit. As the jury size increases, an individual 
juror’s propensity to vote to acquit with an innocent 
signal falls, and the chance that there is not a single 
vote to acquit rises. As a result, it becomes more likely 
that an innocent defendant is wrongfully convicted 
(Feddersen and Pesendorfer 1998). In laboratory jury 
voting experiments, subjects tend to vote strategically 
as predicted by the Nash equilibrium. However, the 
unintuitive numbers effect is not supported by experi- 
mental data and is not implied by a quantal response 
equilibrium analysis (Guarnaschelli, McKelvey, and 
Palfrey 2000). 


GAMES WITH MULTIPLE EQUILIBRIA: 
STEP-LEVEL PUBLIC GOODS GAMES 


In some binary-choice games the expected payoff func- 
tion for participating is not decreasing in p. For exam- 
ple, in any collective political activity where a critical 
mass is required to achieve a desired outcome (e.g., 
regime change), the net reward from participating will 
be higher as others become more involved.~ There- 
fore, the payoff difference function is increasing in 


2i In the extreme case when u — oo, players volunteer with prob- 
ability one-half, irrespective of the number of potential volunteers, 
and the chance that no one volunteers falls exponentially, smce the 
probability of no volunteer s 2—", 

In the discussion that follows we treat the threshold as a sharp 
cutoff even though it is more reasonable m most contexts to model 
the threshold as a range of participation over which the probability of 
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the probability of participation, which permits mul- 
tiple crossings as shown in Figure 2. This is intuitive, 
since there may exist both low-participation equilib- 
ria and high-participation equilibria in such “coordi- 
nation” or “assurance” games” A particular example 
is a step-level public goods game, where N players de- 
cide whether or not to “contribute” at cost c. If the 
total number of contributions meets or exceeds some 
threshold n*, then the public good is provided and all 
players receive a fixed return, V, whether nor not they 
contributed. Here we assume that the contribution is 
like an effort that is lost if the threshold is not met, 
so there is “no rebate.” The threshold n* could corres- 
pond to a required number of participants in an em- 
bargo or signatures on a petition.* 

In the standard linear public goods games without a 
step, observed contributions in experiments are posi- 
tively related to the marginal effect of a contribution on 
the value of the public good, known as the marginal per 
capita return (MPCR).” Anderson, Goeree, and Holt 
(1998) have shown that a logit quantal response anal- 
ysis predicts this widely observed MPCR effect. This 
raises the question whether there is a similar measure 
or index that would predict the level of contributions 
in step-level public goods games. One would intuitively 
expect that contributions are positively related to the 
total (social) value of the public good (NV) and nega- 
tively related to the minimum total cost of providing it 
(n*c). Croson and Marks (2000) have proposed using 
the ratio of social value to cost, which they call the “step 
return:” SR = NV/n*c. Based on a meta-analysis of 
several step-level public goods games, they conclude, 
« .. Subjects respond to the step return just as they 
correspond to the marginal per capita return (MPCR) 
in linear public goods games: higher step returns lead 
to more contributions.” 

First, we consider whether there is a clear theoret- 
ical basis for expecting contributions to be positively 
related to step return measures. A contribution in this 
game pays off only when it is pivotal, i.e., when exactly 
n* — 1 others contribute, which happens with probabi- 
lity 


(> e 3 


where, as before, p denotes the probability that others 
participate. The difference between the expected 
payoff of contributing and that of not contributing is 


oe a a a 
success 1s sharply increasing. The use of a sharp cutoff simplifies the 
analysis and is standard in the literature (see, e.g, Lohmann 1994) 
B Stability arguments can often be used to rule out the middle 
equilibrium if there are three crossings as in Figure 2. For low yu, 
this middle equilibrium is usually close to a mixed Nash equilibrium 
with “perverse” comparative statics properties. The high- and low- 
participation equilibna then correspond to low-effort and high-effort 
pure-suialeay Nash equilibria that often arise in coordination games 

Gilligan (2003) considers the problem of determining the “cor- 
rect” number of countries needed to ratify a treaty. A higher thresh- 
old mdicating broader support typically requires a less restrictive 

ent. 

This literature 1s surveyed in Ostrom’s (1998) presidential address 

to the Amencan Political Saence Association and in Miller 1997. 
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FIGURE 4. - tt-Differences and 
the Inverse Distribution Line for Different 


Thresholds In Step-Level Public Goods Games 
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The right side is a single-peaked function of p, and 
equating its derivative to zero yields a unique max- 
imum at p = (n* — 1)/(N — 1). Figure 4, drawn for 
V=6, c=1, and N = 10, shows these “hillshaped” 
expected-payoff-difference lines for three values of the 
threshold: n* = 3, 5, 8. (Please ignore the “n* = {5, 8}” 
line, which pertains to a multiple-step case considered 
later.) In each case there are two Nash equilibria in 
mixed strategies, determined by the crossings of the 
thin line with the horizontal line at zero. The inverse 
distribution line is plotted for the case of a logistic 
distribution, i.e., F(x) = 1/(1 + exp(—x)), and w =1. 
As before, the intersection of the inverse distribution 
line with the thin lines determines the quantal response 
equilibrium, which is unique for all three values of the 
threshold in this numerical example. 

Recall that the step return is NV/n*c, which is in- 
creasing in N and V and decreasing in n* and c. In 
order to evaluate these properties in the context of 
the quantal response predictions, note that the bell- 
shaped nature of the expected payoff differences im- 
plies that there may be multiple quantal response equi- 
libria. It follows from Proposition 1, however, that any 
factor that shifts the expected payoff difference line 
upward will raise the equilibrium probability in a stable 


26 More generally, when the expected-payoff-difference line is m- 
creasing there may be multiple equilibria for some values of the 
error rate u. For instance, a slight upward shift m the “n* = 8” line 
in Figure 4 would result in three quantal response equilibria. The 
stability analysis associated with Figure 2 can be used to show that 
the middle equilibrrum ıs unstable; see also Fey 1997 and Palfrey 
and Rosenthal 1988. The likelthood of having multiple equilibria 
is increased when yp is small and the uF—!(p) line is essentially 
honzontal for p between zero and one. 
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equilibrium. Since the difference in Eq. (5) is increasing 
in V and decreasing in c, we conclude that the equi- 
librium contribution probability will be increasing in 
V and decreasing in c, just as indicated by the step 
return effect. Next, consider the effect of the numbers 
variables, N and n*, beginning with a somewhat infor- 
mal graphical analysis (precise results are presented 
in Proposition 4, below). Recall that the maximum of 
the expected-payoff-difference “hill” is at a probability 
of (n* — 1)/(N — 1), so an increase in N tends to shift 
this function to the left. Note that a leftward shift in 
the thin line labeled n* = 3 in Figure 4 will lower the 
equilibrium probability, but a slight leftward shift in the 
line labeled n* = 8 will move the intersection point up 
along the thick line and, hence, will raise the quantal 
response equilibrium probability. Thus an increase in N 
can result in a decrease in the equilibrium probability 
when the threshold is low and an increase when the 
threshold is high.”’ The effects of changes in the thresh- 
old, n*, are similar. Note that the quantal response 
probability of contributing does not decrease mono- 
tonically with the threshold: when n* increases from 3 
to 5, the equilibrium probability increases from .43 to 
.56, and then drops to .27 when n* = 8. The intuition is 
that when the threshold rises and it is still likely that the 
public good will be provided, individual contributions 
will rise, but contributions drop dramatically when too 
many contributions are needed for provision. To sum- 
marize, in a quantal response equilibrium, a higher step 
return ratio leads to more contributions when it is due 
to a higher total value of the public good or a lower 
cost of provision, but not necessarily when it is due to 
an increase in the number of potential contributors or 
to a lower threshold. Thus the (admittedly theoretical) 
analysis here yields only qualified support for the use of 
the step return as a rough measure of the propensity to 
contribute in a binary step-level public goods game.” 

Of course, even when individual contributions rise 
in response to the increased threshold, the probability 
that the public good is actually provided may decrease, 
since more people are needed to meet the threshold. 
For the numeric example represented in Figure 4, the 
probability of success drops from .83 to .62 to practi- 
cally zero when n* is increased from 3 to 5 to 8. van 
de Kragt, Orbell, and Dawes (1983) report an experi- 
ment that implemented a step-level public goods game 
with binary contributions and found that increasing the 
number of contributors needed for success reduced the 
incidence of successful provision. The next proposition 
shows that these findings are in line with QRE predic- 
tions when there is sufficient noise. 


27 See, for instance, Offerman, Schram, and Sonnemans 1997 for 
expermmental evidence on some of these comparative static results 
78 Nor are the numbers effects in a Nash equilibrium necessarily 
consistent with the qualitative properties of the step return ratio. 
This is because an increase in the threshold n* shifts the maxmmum 
of the expected-payoff-difference line to the nght in Figure 4, which 
18 likely to shift the rightmost (stable) mixed Nash equilibrium to the 
right. Thus a mise in n*, which lowers the step return, can raise the 
mixed Nash contribution probability. 


FIGURE 5. QRE Probabliltiles of Individual 
Contributlon and Successful Group Provision 
of a Step-Level Public Good, as a Function 

of the Provision Polnt 
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Proposition 4. For a high enough error rate, u, the 
quantal response equilibrium for the step-level public 
goods game is unique and predicts that individual con- 
tributions first rise and then fall with the threshold, n*, 
while the probability of successful provision always de- 


creases with n*. 


This proposition, which is proved in the Appendix, 
is illustrated in Figure 5, which was drawn for the case 
where V = 6,c = 1, N = 10, u = 1.5, and with the pro- 
vision point, n*, varying from 1 to 9. A movement to 
the right in the figure corresponds to an increase in the 
number of contributors needed for successful provi- 
sion, which reduces the probability of success in a quan- 
tal response equilibrium. As the step level is increased, 
individual contributions first increase to meet the 
challenge and then fall as the threshold becomes more 
unattainable. Interestingly, Palfrey and Rosenthal 
(1988) derive this result in an equivalent manner by 
introducing random, individual-specific “joy of giving” 
(or “warm-glow” altruism) shocks that are added to a 
person’s payoff for a contribution decision.” Proposi- 
tion 4 extends their analysis by showing that the prob- 
ability of successful provision is decreasing in n*. 

Finally, it is interesting to see how contribution be- 
havior changes as multiple steps, or thresholds, are in- 
troduced. Suppose, for instance, that in addition to the 
n* = 5 threshold, there is another threshold at n* = 8: 
with five or more contributions, everyone receives a 
return of one from the public good, while with eight 
contributions or more, the return is two. This multiple- 
step case can be analyzed in the same manner as before. 


79 The Nash equilibrium for the resultmg game of mcomplete infor- 
mation is mathematically equivalent to a quantal response equihb- 
num. Palfrey and Rosenthal (1988) prove that individual contribu- 
tions first rse and then fall with the threshold (see thear Table 2). 
They also show that the number of potential contributors, N, has the 
reverse effect: indrvidual contributions first fall and then rise with 
increases in N 
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Now there are two points at which one’s contribution 
can be pivotal, and the expected payoffis the sum of the 
two effects. In terms of Figure 4, the expected payoff 
lines for n* = 5 and n* = 8 get “summed,” as indicated 
by the n* = {5, 8} line in Figure 4 (the cost of con- 
tributing only enters once, which is why the endpoints 
of this are still at —1). The introduction of the extra 
threshold at n* = 8, which by itself results in a low 
contribution probability, dramatically increases contri- 
butions: the QRE contribution probability is .73 and 
the probability that at least five people contribute is as 
high as .97. An immediate extension of this analysis 
is that adding more steps, without reducing the payoff 
increment at any of the existing steps, will increase 
quantal response contribution probabilities in a binary 
public goods game. 


VOTING PARTICIPATION GAMES 


Another binary choice of considerable interest is the 
decision whether or not to vote in a small-group sit- 
uation where voting is costly and a single vote has 
a nonnegligible effect on the final outcome, e.g., the 
decision whether to attend a faculty meeting on a busy 
day. The analysis is similar to that of a step-level pub- 
lic goods game, since the threshold contribution, n*, 
corresponds to the number of votes needed to pass a 
favored bill. In a real voting contest, however, the vote 
total required to win is endogenously determined by 
the number of people voting against the bill. If there 
are two types of voters, those who favor a bill and 
those who oppose, then the equilibrium will be char- 
acterized by a participation probability for each type. 
Here we restrict attention to a symmetric model with 
equal numbers of voters of each type, equal costs of 
voting, c, and symmetric valuations: V if the preferred 
outcome receives more votes and zero otherwise. Ties 
in this majority rule game are decided by the flip of 
a coin. Note that the public goods incentives to free- 
ride are still present ın this game, since voters benefit 
when their side wins, regardless of whether or not they 
incurred the cost of voting. 

The analysis of the majority voting game is a straight- 
forward application of the approach taken in the pre- 
vious sections. The gain from a favorable outcome is V, 
so the expected-payoff-difference is V times the prob- 
ability that one’s vote affects the outcome minus the 
cost of voting. (Obviously, the net cost of voting could 
be small or even negative if voting is psychologically 
rewarding or if there are social pressures to vote, e.g., 
to attend a faculty meeting.) Since a tie is decided by 
the flip of a coin, the probability that a vote is piv- 
otal is one-half times the probability that it creates or 
breaks a tie. In a symmetric equilibrium with common 
participation probability, p, it is straightforward to use 
the binomial formulas to calculate these probabilities, 
and the expected payoff difference for voting is then 
V/2 times this “influence probability” minus the cost of 
voting.» 


30 Suppose there are two groups of equal size, N, and consider a 
player m group 1 The player’s vote 1s pivotal only when the number 
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FIGURE 6. Nash and Quantal Response 
Voting Probabllittes Under Majority and 
Proportional Rules 
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Figure 6 shows the expected-payoff-difference as 
a function of the common participation probability, 
which is labeled “majority rule.” The parameters that 
were used to construct this figure are taken from 
Schram and Sonnemans (1996b), who conducted an ex- 
periment based on this game form with N = 6, V = 2.5, 
and c=1. The “U” shape of the expected-payoff- 
difference reflects the fact that a costly vote is wasted 
when the preferred outcome is already winning or 
when it cannot win even with an extra vote. Indeed, 
the expected value of a vote is highest when either no 
one else or everyone else votes, since a vote is then 
guaranteed to be pivotal by breaking or creating a tie. 
In contrast, when all others vote with probability 1/2, 
one extra vote is likely to be superfluous or not enough 
and its expected value is therefore small. As in previous 
sections, the mixed Nash prediction is determined by 
where the expected-payoff-difference line crosses the 
zero line: there are two Nash equilibria, one in which 
almost no one votes and another in which almost ev- 
eryone votes (Palfrey and Rosenthal 1983). 

The quantal response equilibrium is determined by 
the intersection of the expected-payoff-difference line 
and the inverse distribution function (thick lines).*! 


of voters in group 1 1s equal to m — 1 or m, where m denotes the 
number of voters in group 2, which happens with probability 


X e Ca ame, 


mml 
Dae O ai 


where, as before, p denotes the probability with which all others (in 
both groups) vote. The first term represents the probability that a tie 
is created and the second term is the probability that a tle is broken. 
A player’s expected payoff ıs V/2 times this “influence probability” 
mmus c, the cost of voting. 

31 Palfrey and Rosenthal (1985) use essentially the same techniques 
to determine the Bayesian—Nash equilibrium in a voting game with 
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The u parameter of .8 used to construct the steeper 
line was selected so that the QRE prediction would 
be at about the same level (30% to 50%) as the vote 
participation probabilities reported by Schram and 
Sonnemans (1996b) in the initial periods of their ex- 
periment. Interestingly, the voting probabilities started 
high and then decreased to stabilize somewhere in the 
20% to 30% range. This downward trend is crudely 
captured by a reduction in the noise parameter py to .4 
as indicated by the second inverse distribution line in 
Figure 6.7 

Schram and Sonnemans (1996b) also considered a 
“proportional rule” game in which the payoff for all 
participants is the proportion of votes for their pre- 
ferred outcome, minus the cost of voting if they voted. 
Again, it is straightforward to use the binomial for- 
mula to calculate the expected proportion of favorable 
votes, contingent on one’s own decision of whether 
to vote, as a function of the common participation 
probability, p.*? The expected payoff difference for 
this proportional representation game is the increase 
in the expected proportion of favorable votes, minus 
the cost of voting. This difference is declining every- 
where because one’s vote has a smaller impact on the 
vote proportion as the probability of others’ participa- 
tion increases. The expected-payoff-difference line is 
labeled “proportional rule” in Figure 6, where the pa- 
rameters are again taken from Schram and Sonnemans 
(1996b): N = 6, V = 2.22, and c = 0.7. The Schram and 
Sonnemans data for the proportional rule experiments, 
plotted as the lower line in Figure 7, started in the 30% 
to 40% range and ended up between 20% and 30% 
in the final periods. Note that participation is initially 
higher with the majority rule than with the proportional 
rule, while this difference disappears in the final peri- 
ods of the experiment when the voting probabilities 
are close to 25%, well above the Nash predictions for 
these games. This result is not surprising from a QRE 
point of view, since the two expected-payoff-difference 
lines cross at p = .25, at which they intersect with the 
inverse distribution line (for u = .4). The result, how- 
ever, is unexplainable by a Nash analysis for which 
the intersection of the two expected-payoff-difference 
lines plays no role and only “crossings at zero” mat- 


mcomplete information. In their paper, individual cost-of-voting 
shocks are added to each person’s payoffs The resulting Bayesian— 
Nash equilibrium is mathematically equivalent to a quantal response 
equilibrium. 
22 Alternatively, this downward adjustment could be explained by 
the u = .4 line, together with the dynamic stability argument under 
To Participate or Not? (above), which produces directional move- 
ments of the type indicated by the arrows on the homzontal axis in 
Figure 2. 
B Using the same notation as before, the expected payoff difference 
for a player in group 1 is 
N-1 N 
boo Gs | n+l Mm | 
nı mj n+tm+1 m+m 


A| mÛ m ml 





xp™t™(i -pj m-n- — e, 


where the outside sum pertains to the decisions of the N — 1 others 
of one’s own type, and the inside sum pertains to the N voters of the 
other type 
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FIGURE 7. Voting Participation Rates with 
Random wap (Source: Schram and 
Sonnemans 1996D) 
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ter. For the parameter values of the experiment, these 
crossings are at p = .05 and p = .95 for the majority 
rule treatment and at p = .09 for the proportional rule 
treatment, and seem to have little predictive power 
for the results of the Schram and Sonnemans 1996b 
experiment.” To summarize, both the qualitative data 
patterns and the magnitude of the observed voting 
probabilities are consistent with a QRE analysis (but 
not with Nash), as can be seen from Figures 6 and 7.*° 
This general approach may be extended to cover 
cases with asymmetries, e.g., when one type is more 
numerous than another. With asymmetries, the equi- 
librium will consist of a participation probability for 
each type. These two probabilities will be determined 
by two equations analogous to Eq. (1), with the ex- 
pected payoff for participation (voting) being a func- 
tion of the number of potential voters of each type 
and the equilibrium participation probabilities. While 
a simple graphical analysis of this asymmetric model 
is not possible, it is straightforward to proceed with 
numerical calculations, for example, to show that the 
smaller group is more likely to vote when the costs of 
voting are symmetric (Palfrey and Rosenthal 1983).* 


CONCLUSION 


Many strategic situations are characterized by binary 
decisions, e.g., whether or not to vote, volunteer, 


4 See also Schram and Sonnemans 1996a for a similar experiment 
with slightly different parameter values. 

* Note from Figure 6 that the participation probabilities predicted 
by the quantal response equilibrium are roughly the same under 
majonty rule and proportional rule. This similanty is due to the 
specific parameters used in the experiment and cannot be interpreted 
as a general empirical prediction. 

36 Cohen and Noll (1991) report that members of the majority coali- 
tion abstain more frequently in congressional roll call votes than 
members of the minority coalition. Cohen and Noll note that one 
cost of votmg is that of alienatrng some of the constituents who 


might disagree on the proper vote of their representative. 
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attend a congested event, or perform a costly task 
with social benefits. In this paper we present a simple 
model of equilibrium behavior that applies to a wide 
variety of seemingly unrelated binary-choice games, 
including coordination, public goods, entry, voting par- 
ticipation, and volunteer’s dilemma games. The model 
captures the feature that the decision, whether or not 
to participate, may be affected by randomness, either 
in preferences (e.g., entry or voting costs) or in deci- 
sion making (due to perception or calculation errors). 
The resulting quantal response equilibrium (McKelvey 
and Palfrey 1995) incorporates this randomness in 
the form of an error parameter and nests the stan- 
dard rational-choice Nash equilibrium as a limiting 
case. 

The quantal response equilibrium tracks many be- 
havioral deviations from Nash predictions, e.g., the 
tendency for entry to match the Nash predictions when 
the prediction is one-half and for excess entry when 
the Nash prediction is below one-half. In other words, 
a model with behavioral noise 1s capable of explaining 
the “magical” accuracy of Nash predictions in some 
experiments and the systematic deviations in others. 
The observed over-entry when Nash predictions are 
low is analogous to the over-participation in voting 
experiments, which is explained by a quantal response 
analysis. The participation rates in these experiments 
are roughly the same for the majority and proportional 
outcome rule treatments, which are consistent with the- 
oretical calculations for the parameters used in the ex- 
periments. Similarly, the quantal response model tracks 
intuitive “numbers effects” observed in volunteers’ 
dilemma and step-level public goods experiments, both 
when these effects are consistent with Nash predictions 
and when they are not. 

The quantal response equilibrium generalizes the 
standard Nash theory by allowing for stochastic ele- 
ments. The scale of these elements, as measured by the 
error rate u, determines how closely decisions match 
perfect-rationality predictions. Despite the unspecified 
nature of the stochastic elements, the quantal response 
equilibrium provides clear, falsifiable predictions for 
many of the binary-choice models considered in this 
paper. For example, the predicted participation prob- 
abilities for the entry games are less extreme than 
the Nash predictions (i.e., they lie between one-half 
and the mixed-strategy Nash equilibrium) for any er- 
ror distribution F. Similarly, the predicted volunteer 
rates for the volunteer’s dilemma are less extreme 
than the Nash predictions since the expected-payoff- 
difference is decreasing in the probability of volun- 
teering. In addition, there are key differences between 
Nash and quantal response equilibrium predictions 
such as the effect of large numbers on the probabil- 
ity of getting at least one volunteer or one vote to 
acquit under unanimity. Taken together, these results 
indicate that standard “rational-choice” game theory 
can be enriched in a manner that increases its behav- 
ioral relevance for a wide class of situations. More- 
over, the simple nature of the graphical equilibrium 
analysis will aid researchers in other binary-choice 
applications. 
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APPENDIX 


Proof of Proposttion 3 


The probability, P, that no one volunteers is given by 
(1 — p)™, where the QRE probability of volunteering, p, sat- 
isfies: 
F(p) = BAL —p)"* —C. (A1) 
Combining these equations and using the fact that F-'(p) is 
symmetric, .e., F-!(p) = —F-1(1 — p), allows one to express 
(A1) in terms of the probability that no one volunteers: 
uF(PÈ) =C- BP, (A2) 
from which the derivative of P with respect to N can be 
established as 


dP _ Plog(P) 


dN N 


(N — 1)Bf(F-1( PUN)) + Poa 





Note that dP/dN can only be nonnegative when u < 
P'~Y") Bf F-'(PY*)), The right side of this mequality is 
bounded by Bmax(f ), so dP/dN has to be negative for large 
enough u. Finally, suppose, in contradiction, that limy 
P > 0. This implies that P'/" tends to 1, so uF~1( PYY) —> oo 
when u > 0. This contradicts (A2) since the right side limits to 
C-BP, which is finite Hence, P tends to zero when N tends 
to infinity. In fact, from (A2) it follows that for large N, P 
converges to F(C/j)", which tends to zero since F(C/) < 1 
foru >0. QED. 


Proof of Proposition 4 
The QRE probability of contributing, p, satisfies 


uF (p) = VPR (p) -c 


where w > 1 denotes the threshold and P¥ (p) is the proba- 
bility that w — 1 of the N — 1 others contributed (see Eq. [6]). 
The solution to (A3) ıs unique when the derivative of 
the left side is everywhere greater than that of the nght 
side. The derivative of P*(p) with respect to p is given by 
((w —1)/p — (N — w)/(1 — p))P¥ (p) and the relevant con- 
dition for uniqueness 1s therefore 


u > VRF (p) (w — 1)/p —(N—w)/(L—p))P,, (A4) 


Note that the right side is negative when w = 1, and for w > 2 
it is less than V/{(F-!(p))PX(w — 1)/p. The latter expression 
can be rewritten (N — 1)VAF-!(p))P-}, which is bounded 
by (N — 1)V max(f ). So for u > (N — 1)V mar(f ), the quan- 
tal response probability of contributing is unique for all val- 
ues of the threshold. The derivative of PY (p) with respect to 
w < N (holding p fixed) is PY, (p) — PX(p), which simplifies 
to P¥(p)(1 — w/(N — w)(1 — p)/p). Together with (A3) this 
implies that the derivative of the QRE probability, p, with 
respect to the threshold, w, is given by 
dp 1-p 


dw N-—w 
VAP) (N= »)/(1 =p) = w/P) PS 
w+ VA(F-\(p))((N — w)/(1 — p) — ( —1)/p) PX” 
(AS) 


Note that the denominator of the second fraction on the 
right side is positive when the condition for a unique QRE 


(A3) 
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(Eq. [A4]) is satisfied. The sign of dp /dw is then determined 
by the numerator, which is positive iff p > w/N. The intu- 
ition for this result is straightforward: as long as the “inverse 
distribution” line intersects the “expected-payoff-difference” 
line to the right of its maximum (Le., p > w/N), an increase 
in w shifts the expected-payoff-difference to the right and 
moves the intersection point upward. The reverse happens 
for higher values of w when the inverse distribution line cuts 
the expected-payoff-difference line to the left of the maxi- 
mum (see also Palfrey and Rosenthal 1982). 

The probability, Q¥, that the public good is provided is 
given by 

S/N 
a=}, ( k)eta ap 
baw 

and its derivative with respect to w (for w < N) is 











dOS dQ} dp _ dp 1-p 
dw = Qoni — Qe Bae = NPY (E-F E) 


Combining (A5) and (A6) shows that Q™ is decreasing in w. 
QED. 
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American Communities 
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e test propositions about how different forms of civic engagement are related to democratic 

representation in American communities. Our data are for the samples of communities, their 

citizens, and their leaders originally examined by Verba and Nie in Participation in America 
(1972). Our analyses of those data indicate that membership in bridging social—capital civic associations 
is unrelated to democratic responsiveness of leaders to the mass public but that bonding sociaLcapital 
membership is negatively associated with such responsiveness. We also demonstrate that bonding social- 
capital civic engagement weakens the democratic linkage processes inherent in elections. 


oes more civic engagement—ass participation 
in private associations and organizations—lead 
to more responsive democratic governance? 
Considerable research has argued that to be the case, 
and something of a scholarly industry has arisen that 
assesses the character, causes, and consequences of 
such engagement in countries around the world (as 
recent examples, see Mettler 2002 and Pattie, Seyd, 
and Whiteley 2003). The value of civic engagement 
for the health of democracy has been proclaimed too, 
by civic reformers inside and outside the academy (as 
examples from a voluminous literature, see Kirlin and 
Kirlin 2002 and Swain 2001). Notable attention is also 
being devoted to how higher civic engagement might 
be fostered by the educational system (e.g., Dubnick 
2003). Yet we have no systematic evidence for the lit- 
eral representational benefits of civic engagement. 
Four scholarly works offer especially prominent ar- 
guments for the representational value of civic engage- 
ment. Fukuyama (1995, 3-32, 355-63), Oliver (2001, 
esp. 205), and Uslaner (2002, 218) propose distinctive, 
though related, conceptions of how civic engagement 
enhances civic values in the mass public and respon- 
siveness by the political elite and government. Doubt- 
less, however, the most systematic advocate for the 
influence of civic engagement on democratic respon- 
siveness is Robert Putnam. In Bowling Alone Putnam 
argues that higher levels of civic engagement have two 
important effects. They increase the level of social cap- 
ital, “social networks and the norms of reciprocity and 
trustworthiness that arise from them” (Putnam 2000, 
19) and thus, second, increase the degree to which the 
mass public communicates its preferences to political 
leaders and achieves successful representation of those 
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preferences in public policy (Putnam 2000, 338-49). 
In Making Democracy Work Putnam (1993, 89-91) 
also argues that civic engagement enhances democratic 
representation by facilitating interest aggregation and 
articulation. 

Two key expectations are common to these argu- 
ments. The first is that greater civic engagement is 
associated with better representation of mass prefer- 
ences in government policy. Second, the effect of civic 
engagement on government responsiveness is separate 
from those of conventional democratic participation in 
elections and voting. Indeed, Putnam (2000, 37) explic- 
itly observes that voting is not a form of social capital, 
thus social capital and conventional participation must 
influence government in distinctive ways. 

Verba and Nie’s (1972) seminal work on mass polit- 
ical behavior offers a little-appreciated elaboration of 
these causal expectations. Verba and Nie (1972, 47- 
48, 322-27) distinguish civic engagement (that they 
refer to as communal participation) as being a form 
of mass political behavior that delivers “information” 
about group preferences especially well to the political 
elite. In contrast, voting participation in their argument 
places high “pressure” on the political elite to respond 
to mass preferences but provides little explicit infor- 
mation on those preferences. Political leaders may be 
particularly influenced, further, by civic engagement 
and the information it provides. Verba and Nie (1972, 
323) even conclude that leaders may at times respond 
to the preferences of civic engagement activists to the 
neglect of general mass preferences. Thus Verba and 
Nie—like Putnam, Oliver, and Uslaner—expect civic 
engagement to have a direct causal effect on repre- 
sentational outcomes independent of the influence of 
other modes of participation such as voting, and their 
construction of the information and pressure inducing 
functions of different forms of participation explains 
how this can be the case. 

Verba and Nie (1972, 326-27) also propose a sec- 
ond path by which civic engagement might influence 
representation—by way of an interaction with the level 
of voting participation. They argue that voting partici- 
pation, which imposes pressure, interacts with civic en- 
gagement, which offers information on preferences, so 
that higher levels of both produce especially high and 
positive representational outcomes. They observe that 
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“the general pressure of the vote appears to enforce 
the effectiveness of the other, more specific modes of 
participation” (326). 

Putnam (2000, 22), however, elaborates the pre- 
ceding general expectations by conceptualizing dif- 
ferences between civic engagement in bonding social 
capital organizations and bridging social capital ones. 
Bonding social capital organizations, which are often 
based on kinship or whose membership tends to be 
class-exclusive, are “inward looking and tend to rein- 
force exclusive identities and homogeneous groups.” 
Putnam cites fraternal organizations as a good exam- 
ple. In contrast, bridging groups are “outward looking 
and encompass people across diverse social cleavages,” 
as would be the case with civil rights and ecumenical re- 
ligious organizations. Bridging organizations, further, 
particularly enhance “reciprocity,” the expectation that 
social exchanges in a community will be directed at mu- 
tual rather than individual benefit. These distinctions 
suggest that bridging-group civic engagement should 
engender positive representational benefits but that 
bonding-group engagement is harmful to the demo- 
cratic process. It is only bridging groups, and not bond- 
ing social capital ones, that generate the reciprocity 
that is central to Putnam’s expectations for the ben- 
efits of social capital. Bonding social capital groups, 
in contrast, promote exclusive interactions and values 
that encourage government responsiveness to the few 
instead of to the community at large. 

These expectations about the two kinds of civic 
engagement comport with other conceptual obser- 
vations. In Making Democracy Work (1993, 171-76) 
Putnam argues that social networks must encompass 
people of various social groups and statuses to engen- 
der reciprocity—as only bridging organizations do by 
definition. Fukuyama (1995, 23-32) and Uslaner (2002, 
26-38) make related arguments. Other scholars have 
also concluded that some kinds of civic engagement 
may lead to values that compromise reciprocity, trust, 
and democratic processes (e.g., Fiorina 1999, Hero 
2003, and Levi 1996, 47). 

Like virtually every student of associational involve- 
ment, Putnam (2000, 48-64) also postulates that active 
and involved civic association membership will have 
different consequences than nominal membership. Ac- 
tive membership is widely believed to confer more ben- 
efits to individual organizational members and to the 
efforts of the organization itself to achieve its goals. 
Thus higher levels of active and involved membership 
in bridging and in bonding social capital civic associa- 
tions can be expected to induce especially high positive 
and negative effects, respectively, on democratic rep- 
resentation. 

The preceding distinctions suggest that levels of 
bridging-group and bonding-group civic engagement 
should have different direct effects on democratic re- 
sponsiveness. But the second causal path that Verba 
and Nie suggest above implies that these two forms of 
civic engagement might also influence representation 
differently in interaction with the level of mass voting 
behavior. Higher levels of bridging-group civic engage- 
ment (whether nominal or active), first, might inter- 
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act with higher voting participation to produce espe- 
cially higher levels of responsiveness—because bridg- 
ing groups are expected to promote community-wide 
values that would be reinforced by the pressure of 
voting participation. In contrast, high bonding-group 
engagement might interact with voting participation 
to depress responsiveness. Bonding-group engagement 
promotes exclusive rather than general values, and high 
levels of information about such values being transmit- 
ted to the political elite would moderate the expected 
positive effect of voting.’ 

The preceding conceptual arguments about civic en- 
gagement have been very influential, but there is no 
compelling evidence for them. Only Berry, Portney, 
and Thomson (1993) provide explicit evidence on 
civic engagement and the representation of mass 
preferences—in analyses for five American cities. Yet 
the general implications of this research are unclear 
because the five cities were chosen because of high 
levels of civic engagement and the results cannot be 
generalized to the universe of communities that varies 
in levels of such engagement. 

The remaining empirical work on the policy con- 
sequences of civic engagement and social capital does 
not address representation per se (e.g., Knack 2002 and 
Putnam 1993, 99-162, 2000, 287-335). Such analyses 
assess objective levels of policy outputs, outcomes, or 
innovation but not how any aspect of policy literally re- 
lates to mass preferences. Further, such analyses—and 
even those by Putnam—do not distinguish bridging 
and bonding engagement and rarely take account of 
nominal versus active membership. In the large, sepa- 
rate body of research on mass participation generally, 
only the work of Verba and Nie (1972) systematically 
examines the representational consequences of civic 
engagement. Yet Verba and Nie do not take account 
of the conceptual distinctions about civic engagement 
outlined above because their research preceded the 
development of these conceptualizations. 

We examine the effects of active and nominal levels 
of bridging and bonding civic engagement in Amer- 
ican communities on mass—elite agreement on com- 
munity public policy priorities. Our tests are for the 
set of communities—and samples of their citizens and 
leaders—examined by Verba and Nie in Participation 
in America (1972). The parts of this data set that are 
critical to the present analysis—the data on mass—elite 
policy “concurrence,” on various community attributes, 
and on the communities of residence of the individual 
mass respondents—were not archived by Verba and 
Nie and were effectively lost. But Hill (2004) recon- 
structed and validated those data with help from par- 
ticipants in the original research project. This data set 
is perhaps uniquely valuable for studying the relation- 
ships of concern here too. Oliver’s (2001) and Putnam’s 


1 Verba and Nie (1972, 176-80) provide what we judge to be unique 
evidence on a critical assumption for this expectation, one on which 
some critics of Putnam have raed questions Even members of 
bonding-group civic associations with explicitly nonpolitical organi- 
zational fo reported notable levels of political discussion of com- 
munity affairs in these organizations. 
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(2000, esp. 336-40) conceptions of the roles of civic 
engagement for democratic governance are especially 
relevant to local community settings. Further, Putnam’s 
(2000, 48-64) empirical description of trends in civic en- 
gagement in the United States indicate that the Verba 
and Nie data were collected near the peak level of 
such engagement, when its effects on the democratic 
process should have been especially evident. Finally, 
the richness of the Verba and Nie data set allows us to 
carry out especially appropriate empirical tests of the 
expectations discussed above. 

The expectations about civic engagement and repre- 
sentation that are summarized above imply the follow- 
ing specific hypotheses about mass—elite policy agree- 
ment that are testable with the Verba and Nie data on 
these communities. 


H1: The higher the membership in bridging social- 
capital organizations in a community, the higher 
the agreement on the policy agenda between the 
mass public and the elite. 

H2: The higher the membership in bonding social- 
capital organizations in a community, the lower 
the agreement on the policy agenda between 
the mass public and the elite. 

H3: Higher active membership levels in bridging 
and in bonding social-capital organizations in 
a community should lead to especially higher 
and lower agreement, respectively, between the 
mass public and the elite on the policy agenda. 

H4: Levels of bridging-group and bonding-group 
civic engagement will interact with levels of vot- 
ing participation in communities to produce es- 
pecially high levels and low levels, respectively, 
of agreement between the mass public and the 
elite on the policy agenda. 


DATA AND MEASURES FOR THE 
HYPOTHESIS TESTS 


Most of the data employed here come from the large, 
multipart effort to examine political behavior and 
representational processes in American communities 
that culminated in the publication of Participation in 
America. In that project a representative sample of 
the American public was interviewed in 1967 about a 
wide range of their political attitudes, the degree to 
which they engaged in various forms of conventional 
political behavior, and their assessment of political is- 
sues in their communities. Further, 64 communities 
with populations projected to be under 50,000 were 
chosen for oversampling so that estimates of average 
community mass attitudes and participation could be 
derived (Verba and Nie 1972, 345-49). Samples of 
political elites chosen by positional status were inter- 
viewed about their attitudes and participation in these 
64 communities. The interview team also collected con- 
siderable data on the demographic attributes of these 
communities and on various political and nonpolitical 
organizations there. 

The dependent variable in our analyses is Verba 
and Nie’s measure of mass—elite policy “concurrence,” 
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or agreement between the mass sample and the elite 
sample in each community on the policy agenda for 
that community. This measure comports closely with 
one of Oliver’s (2001, 205), Putnam’s (2000, 338-47), 
and Berry, Portney, and Thomson’s (1993, 101-14) ma- 
jor assertions about the democratic consequences of 
civic engagement—that it increases communication be- 
tween the mass and the elite. Along with these au- 
thors, we assume that higher scores on this measure 
are also associated with better representation of mass 
preferences in local government policy. This assump- 
tion is widely adopted, too, as reflecting one avenue by 
which such policy representation may be achieved—in 
analyses of dyadic representation between legislators 
and their constituents (e.g., Miller and Stokes 1963, 
50-51), dynamic representation of mass preferences in 
national government policy (Stimson, MacKuen, and 
Erikson 1995, 545-46), and partisan representation 
across Western industrial democracies (Powell 2000, 
248). Substantial empirical support also exists for this 
assumption, especially in the extensive dyadic repre- 
sentation literature ing from the original work 
of Miller and Stokes (1963) to the most recent paper 
on this subject by Hurley and Hiil (2003). 

The mass samples in individual communities used 
to create this measure ranged in size from 15 to 30, 
although the vast majority of communities had 22 or 
more such respondents. The elite sample was composed 
of the highest locally elected public official, the highest 
elected county government official, the highest official 
of the largest local Democratic and Republican party 
organizations, the head of the local Chamber of Com- 
merce (or a near-equivalent private executive if there 
was no Chamber of Commerce), the editor or publisher 
of the local newspaper, and the highest official in the 
local school board.? 

Both the mass and the elite samples were asked 
open-ended questions about the most important prob- 
lems facing their communities and the responses were 
used to calculate the concurrence scores (Verba and 
Nie 1972, 412-14). “Component” concurrence scores 
were computed separately for each mass respondent 
with replies from the members of the elite sample 
about: (1) the elite respondents’ own perceptions of 
the most important problems, (2) the elite respondents’ 
perceptions of what the mass public viewed as most 
important problems, and (3) which problems the elite 
respondents themselves had been attempting to solve. 

As an example, suppose a citizen stated that the 
crime rate was one of the most important community 


2 Both the mass and the elite samples are doubtless small, raising 
a possible concern about our ability to validly estimate mass—elite 
agreement in these communities We have replicated our principal 
analyses, however, for subsets of the communities with relatively 
larger and smaller numbers of mass respondents—producing the 
same hypothesis test outcome (eg. to accept or reject the null) 
and reasonably similar regression coefficients for all the theoretically 
important explanatory variables. Hansen (1972, 223, 230; 1975, 1187) 
presents a series of analyses that demonstrate remarkable milari- 
ties m levels and patterns of mass—elite agreement across different 
subsets of the full elite sample (e.g., government officials vs. private- 
sector individuals) and, thus, elite groups of varying stzes. 
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problems. If all the members of the elite sample in his or 
her community also said crime was such a problem, the 
mass respondent would be scored 100 on that response 
(or 50 if half the elite agreed, and so on). If all of 
the elite sample also perceived this to be an especially 
important problem for the public in their community, 
the respondent would be scored 100 on the second 
component. If all the elite respondents said that this 
was a problem they themselves were actively working 
on, the respondent would be scored 100 on the third 
component. Mass respondents were allowed to name 
up to two problems that were of particular importance 
in the community, and both responses were scored as 
outlined here. 

The above “component” scores computed for each 
mass respondent’s named policy problems were then 
averaged, to create a single, overall concurrence score 
for that individual. Then all these individual concur- 
rence scores were averaged for the mass respondents 
in a community to create a community-wide, mass— 
elite policy agreement measure. Higher scores on this 
aggregate measure indicate higher overall mass—elite 
agreement on community policy problems. (Descrip- 
tive statistics on this and all the variables analyzed in 
the paper are given in the Appendix.) 

In response to the concern that individuals might 
agree about a policy problem but have different pref- 
erences for its solution, Verba and Nie (1972) specifi- 
cally observe that “the problems mentioned were most 
often those for which agreement on the selection of a 
problem would imply agreement on the direction, if not 
the details, of a solution. Citizens and leaders complain 
about shortages of community facilities, job shortages, 
traffic, crime, pollution” (328). 

For our most important explanatory variables, for 
bridging and bonding civic engagement, the Participa- 
tion in America data allow us to create measures that 
comport closely with the concepts specified above.’ 
Mass respondents were queried about membership and 
active membership in each of a long list of types of or- 
ganizations. To categorize particular types of organiza- 
tions as bridging or bonding ones, we adopted the gen- 
eral conceptions and particular examples in Putnam 
(2000, 22-24). Thus we categorized fraternal organi- 
zations, labor unions, fraternities, sororities, farm or- 
ganizations, study clubs, and professional societies as 
bonding groups. We categorized service clubs, veter- 
ans’ organizations, political organizations, sports asso- 
ciations, school service organizations, and hobby orga- 
nizations as bridging groups. 


3 These data and the other mass-survey items used to create mea- 
sures for our analyses are taken from the data set that Verba and Nie 
archived with the ICPSR under study number 7015. It was necessary, 
however, for the data reconstruction effort explamed m Hill 2004, 
to discover which c community each mass respondent resided 
in—informaton not included in the archived data set. 

4 We did not code church-group membership into either type of or- 
ganization because Putnam (2000, 22) observes that some religious 
associations can be charactermed as bridging, and some as bond- 
ing, soclal-capital organrzations. Further, the survey question about 
church-group membership used in the Participation in Amenca study 
did not discriminate among the many kinds of such organizations that 
respondents might report engagement with 
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We recognize that some associations provide a 
degree of both bridging and bonding engagement. 
Putnam (2000, 23) himself observes that these groups 
should be judged for relative degrees of each kind 
of social-capital experience. Thus there is some im- 
precision in any effort to associate particular groups 
with only one type of social capital, and some ob- 
servers may believe different classifications of particu- 
lar groups would be preferred. But we have followed 
Putnam’s general and specific conceptions as closely 
as possible, and we report below detailed, supplemen- 
tary analyses that support our coding of groups into 
categories. 

We calculated two original kinds of measures of 
community engagement in these organizations, and we 
report hypothesis test results for both kinds to demon- 
strate the generalizability of our findings) We judge 
the first set of measures to be the theoretically most 
important because they comport not only with the 
distinctive characterizations of bridging and bonding 
groups but also with the discussion in Putnam (1993, 
171-76) of the importance of the density of networks of 
civic engagement across an entire community or polit- 
ical system. These measures also reflect Granovetter’s 
(1973, 1373-76) conceptualization of the density of in- 
dividuals’ social networks in a community—some of 
which are based in civic associations, which he styles as 
having benefits for individuals and the community that 
are very similar to Putnam’s construction of bridging 
and bonding social capital. 

Conceptually, the density of civic engagement refers 
to how extensive association memberships are across 
both mass respondents and a set of organizations in 
a community. To measure this concept of density, we 
follow Granovetter’s examples of how the density of 
individual-level social networks can be extended to 
assess such density at the community level. Thus the 
operational definition, for example, of the density of 
active membership in bridging organizations is the 
total number of active bridging-group memberships 
reported by all the respondents in a community, di- 
vided by the product of the number of possible types 
of bridging organizations multiplied by the number 
of respondents. Higher scores on this measure indi- 
cate communities where more mass respondents are 
linked with each other in more bridging groups. Based 
on this conceptualization, we have created measures 
of the density of nominal and active membership 
in both bridging and bonding social capital organi- 
zations. 

The second set of measures of civic engagement 
replicates generally those used in Putnam (2000, 48- 
64) and Oliver (2001, 20, 216-17), who did not have 
sufficient data to assess the theoretically preferred con- 
cept of density of engagement and thus measured sim- 
ple, nominal, or active membership rates. One example 
measure here, then, for nominal membership in bridg- 
ing groups, is the percentage of mass respondents in 
a given community who reported nominal member- 
ship in one or more of the types of bridging groups. 
We created comparable percentage measures for nom- 
inal bonding-group engagement, active membership in 
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bridging groups, and active membership in bonding 
groups. 

Hypothesis 4 about how levels of civic engagement 
might interact with levels of mass participation in elec- 
tions requires a measure of voting participation at the 
community level. Our measure of this concept is taken 
from Verba and Nie’s (1972, 56-81) intensive analy- 
sis of the “modes of participation,” which explored 
the underlying structure and dimensions for a large 
number of discrete measures of different types of mass 
participation in politics. Their principal components 
analysis of these data uncovered a separate dimension 
of voting participation at the individual level. Verba 
and Nie then aggregated the factor score measure for 
this dimension at the individual level into a measure 
of mean community-level voting participation for their 
own analysis of mass—elite policy concurrence. We em- 
ploy the same community-level measure in our analy- 
Ses. 


Other political and socioeconomic attributes of com- 
munities besides civic engagement, of course, may in- 
fluence the degree of mass—elite policy agreement. 
Thus our analyses control for the theoretically and 
substantively most notable rival explanations of this 
form of democratic representation. First, abundant re- 
search assumes that such representational linkages are 
more successful when the participation of the mass 
public in politics is higher. Verba and Nie (1972, 47- 
48), recall, articulate some of the rationales for this 
common expectation—because higher mass participa- 
tion generally increases pressure on the elite to respond 
to mass preferences, and some forms of participation 
contribute information about the nature of those pref- 
erences. 

Thus our analyses control for the rival effects of all 
the “modes” of political participation that Verba and 
Nie (1972, 56-81) uncovered—that is, for community- 
level, mean voting participation, particularized con- 
tacting, and campaign participation.® Prior research 
on representative democratic politics in various kinds 
of polities, and by Verba and Nie (1972) and Hansen 
(1972, 1975) specifically on the communities under 
study here, suggests that voting participation should be 
a particularly important influence on mass—elite policy 
apreement. 

Second, considerable prior research also demon- 
strates that more competitive elections provide the 
mass public more choice among leaders and policy 
alternatives and lead to greater mass influence over 
actual policy (e.g., Dahl 1971, 23-26). Thus we control 


* The measure of voting particrpation level by community is the 
original one created by Verba and Nie as opposed to the version 
they subsequently developed that 1s “corrected” for the effects of 
mass respondents’ levels of socioeconomic status as a control for a 
rival explanation of mass—elite policy agreement. We control for rival 
explanatory variables like this directly m our analyses, but replication 
tests of our analyses with Verba and Nie’s corrected measure of vot- 
ing participation produce hypothesis test results for civic engagement 
identical to those reported here. 

© We do not control for the measure of communal participation 
created by Verba and Nie because ıt amounts to an aggregate of 
our separate measures of bridging and bonding social-capital avic 
association membership. 


for the effects of the competitiveness of local elections 
with an original measure of that concept created for 
this analysis. That measure takes account of the facts 
that local elections in American communities may be 
legally partisan or nonpartisan and that the type of 
elections held in communities also varies with the form 
of local government operating there. Our scale of com- 
petitiveness is principally based on an ordinal measure 
of the frequency with which elections for the head of 
the local government are competitive, whether based 
on partisanship or not. This information was derived 
from original interview data from community elite in- 
formants. A dozen communities, however, do not have 
scores on the latter measure because there is no single 
elected head of local government—which would be the 
case under some variants of council—manager and com- 
mission forms of government. Competitiveness scores 
for the latter communities are based on additional elite 
interviews with local political party leaders—in which 
they reported the frequency with which the two major 
political parties both generally endorsed local candi- 
dates for elections.’ The final measure of election com- 
petition, then, is a four-point ordinal scale that takes 
account of competitiveness in these r 

One other control variable was selected for its 
prominence in research on democratic governance: the 
population size of the communities. Dahl and Tufte 
(1973) offer an extended theoretical argument for why 
smaller political systems—which they argue are often 
especially socially homogeneous too—should demon- 
strate more successful democratic governance. Fur- 
ther, Oliver (2001, 33-67) advances much the same 
argument specifically for American communities. Our 
measure of the population size of the communities is 
an original estimate for 1967, the year in which the 
survey data were collected. It is calculated by linear 
extrapolation of the change in population between the 
decennial censuses for 1960 and 1970 (U.S. Bureau of 
the Census 1973 and earlier years). 

Finally, we add two control variables because of their 
substantive prominence in various analyses of demo- 
cratic politics in America. First, we control for the 
percentage of the population of communities that is 
ethnically non-White or, more specifically, for the per- 
centage of the population that is either Black, “other 
races” by U.S. Census criteria, or Hispanic and non- 
White, data that were collected originally for the anal- 
yses here (U.S. Bureau of the Census 1973). Diversity 
based on race or ethnicity has, of course, created 
unusual challenges for the democratic process in 
America—especially in the period when our data were 
collected. Finally, because of the distinctive cultural 
character of former Confederacy states and the as- 
sociation of that culture with nondemocratic political 
routines, we include a dummy variable for those com- 
munities located in the South. 


7 Tf both parties report always endorsing local candidates, as one 


example, the community was coded as having high electoral com- 
petition. Further, analyses with an alternative measure of election 
competition for that subset of communities with elected heads of 
local government produced identical hypothesis test results in our 
various models. 
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TABLE 1. 
Agenda Agreement 
Independent Variable 


Voting participation 

Campalgn participation 

Particularized contacting 

Electoral competition 

Percentage population non-White 
Population size 

Southem community 

Density of bonding-group membership 
Density of bridglng-group membership 


Percentage members of one or more 
bonding groups 
Percentage members of one or more 
bndging groups 
Constant 
Ad| R? 
Note. Cell entnes are 


No other rival explanatory variables that we could 
add to the reported models either have significant rela- 
tionships with agenda agreement, change the findings 
with respect to civic engagement, or have sufficiently 
compelling theoretical rationales in our view to require 
that they be added to the models.® 


HYPOTHESIS TEST RESULTS 


Table 1 presents OLS regression models for tests of hy- 
potheses 1 and 2 about the effects of nominal bridging- 
and bonding-group membership on mass—elite policy 
agreement. The first model in the table employs mea- 
sures of the density of bridging- and bonding-group 
membership, and the second model employs the mea- 
sures based on percentages of membership in one or 
more of the bridging- and bonding-groups for respon- 
dents in a given community. The hypothesis test results 
are quite striking. They are also consistent across both 
kinds of measures and are not the product of any mod- 
eling specification problem that we can detect. 
Contrary to hypothesis 1, neither measure of mem- 
bership in bridging social capital organizations is sig- 
nificantly related to mass—elite policy agreement. Yet 
membership in bonding social capital associations is 
quite strongly and negatively associated with mass— 
elite agreement—and with both measures of such civic 


8 We tested as additional predictor vanables measures of median 
family income, change in population size over recent years, isolation 
of the community from large population centers, mcome inequality 
among those mass respondents who were interviewed by Verba and 
Nie, and ethnic diversity among those same mass respondents. 
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Model 2 
068" 


(18.016) 


16.458 
30 


regression coefficients and, in parentheses, 
= Significant at <.01 “ Significant at <.05. * Sigruficant at < 10. n= 64. 


engagement. Thus there is notable support here for 
hypothesis 2 and for the particular, negative charac- 
terization Putnam and others have offered of bonding 
social capital groups, but not, of course, for hypothe- 
sis 1, about the positive effects of bridging groups or 
for more general expectations of how civic engagement 
enhances democratic governance. 

We note that the other results in Table 1 con- 
form generally to expectations. The theoretically most 
prominent predictors—for voting participation, elec- 
tion competition, and community size—are significant 
as hypothesized. The remaining control variables, how- 
ever, are not significant. Further, none of the results 
in Table 1, especially those for the civic engagement 
variables, are the product of collinearity among the 
predictor variables—for which we ran various tests 
and supplementary analyses, the selection of variables 
included in the models, deviant cases, the univariate 
distributional properties of the predictor variable, or 
unusual functional forms in the relationship of civic 
engagement with mass—elite agreement.” The negative 
relationships of bonding-group civic engagement with 
mass—elite agreement and the null relationships with 
bridging-group engagement are both replicated, too, 
when we estimate models that include only a measure 
of bonding- or of bridging-group engagement alone. 

Further, the results in Table 1 for civic engagement 
summed across all types of bonding groups and all types 


9 As examples of the evidence that indicates little collinearity among 


the predictor variables, none of the measures of civic engagement in 
either Table 1 or Table 2 has a variance inflation factor >2.1, and all 
but one of the variance inflation factors are <2.0. 
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TABLE 2. Active Civic Assoclation Membership and Mass-Elite Policy Agenda 


Agreement 
Independent Varlable 


Voting participation 

Campaign participation 
Particularized contacting 
Electoral competition 

Percentage population non-White 
Population size 


Southem community 


Denstty of active bonding-group membership 
Density of active bndging-group membership 


Percentage active members of one or more bonding groups 


Percentage active members of one or more bridging groups 


Constant 
. R? 
Note: Cell entries are 
at < 05 * Significant at <.10 n=64. 


of bridging groups are compatible with the results of 
analyses for individual types of groups in these two 
broad categories. That is, comparable regression mod- 
els with engagement measures for single types of bond- 
ing groups (e.g., fraternal organizations alone) pro- 
duce generally comparable relationships with mass- 
elite agreement.!° Comparable models with civic en- 
gagement measures for single types of bridging social- 
capital groups, and with various subsets of such groups, 
all produce insignificant relationships with mass—elite 
agreement. These analyses provide convergent evi- 
dence for the construct validity of our reported mea- 
sures of civic engagement and indicate that our results 
are robust to the aggregation required to construct 
general measures of bonding- and bridging-group civic 
engagement—which is necessitated by the distinctive 
theoretical expectations for these two categories of 
gro 

Table 2 reports results for hypothesis 3 about how 
active engagement with civic associations should have 
stronger relationships with mass—elite agreement than 
nominal membership alone. Active membership in 
bridging social-capital associations, and by both mea- 
sures of such engagement, is, however, unrelated to 


10 Individual types of bonding groups with relatively larger mean 
membership levels, for which it 1s feasible that community-wide 
effects might be realized, have negative relationships with mass— 
elite agreement. Groups with very small mean membership levels, 
for which community-level effects of their presence are less likely, 
do not demonstrate effects on concurrence alone. When the mem- 
bership percentages for these small groups are summed, however, 
their combined size is also negatively and significantly related to 
mass—ehite agreement. 





14.616 
-27 


regression coefficients and, In parenthesse, standard errors “* Significant at < 01. * Signrhcant 


policy agreement levels, just as is the case for nominal 
membership in such groups. The density of active en- 
gagement in bonding social—capital groups, in contrast, 
is negatively associated with mass—elite agreement, al- 
though the measure for active “percentage” member- 
ship in one or more bonding groups is not significant. 
Thus the finding here on the density of active 
bonding-group membership replicates generally the 
finding for the density of nominal bonding-group mem- 
bership in Table 1—and with a somewhat larger re- 
gression coefficient as was hypothesized. But because 
the second measure of active membership in bonding 
groups (based on the percentage of respondents who 
are members of one or more groups) is not significant, 
there is only mixed support for hypothesis 3. We specu- 
late, however, that the relatively small numbers of mass 
respondents who report active civic engagement could 
explain why such activity demonstrates limited to no 
community-wide effects like the one investigated here. 
Finally, Table 3 reports regression models for the 
test of hypothesis 4 about the possible interaction of 
civic engagement, an information-providing mass ac- 
tivity, with voting participation, a pressure-inducing 
mass activity. We tested such models for all of the 
measures of civic engagement employed above—for 
both of the alternative measures of nominal and ac- 
tive membership in both bridging and bonding social- 
capital groups—using the same base regression models 
as reported in Tables 1 and 2. None of the analyses 
with bridging groups resulted in significant interac- 
tions, while three of the four for bonding groups pro- 
duced negative interactions as hypothesized. Thus, for 
parsimony we report in Table 3 only the interaction 
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Independent Vanable 
Voting participation 


Campaign participation 
Particularized contacting 
Electoral competition 
Percentage population non-White 
Population size 
Southern community 
Density of bonding-grp. membership 
Density of active bonding-grp members 
Members In one or more bonding grps. 


Active members In one or more bonding grps. 


* Significant at <.10 n=64 


models for the four measures of membership in bond- 


Ing groups. 

In Table 3 the negative interactions between vot- 
ing participation and the density of nominal bonding- 
group membership, the density of active bonding- 
group membership, and active membership in one or 
more bonding groups all pass the conventional F-test 
for statistical significance in comparison with the rel- 
evant restricted models. (The interaction test for the 
fourth measure, the percentage of respondents in a 
community who are nominal members of one or more 
bonding groups, just misses statistical significance as 
shown in Model 3.) Thus in all three of these models 
there is a substantial, negative, conditional influence of 
bonding-group membership on mass—elite agreement 
through the interaction with voting participation. 

This conditional relationship merits elaboration. As 
noted above, the effect of voting participation on 
mass—elite agreement is theoretically interpreted as 
a democratic-process mechanism. ‘Thus the significant 
interactions here mean that higher levels of bonding- 
group membership moderate—or degrade—this 
democratic linkage. The conditional effects are sizable 
too, and they can be demonstrated by conventional 
procedures for “decomposing” the full interaction re- 
gression models (Friedrich 1982). ‘That is, one can use 
components of the full interaction model to demon- 
strate how the effect of voting participation on mass— 
elite concurrence is moderated at different levels of 
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TABLE 3. The Interactlon between Bonding-Group Clvic Assoclation Membership and Voting 


Participation and Its Relationship with Mass-Eltte Policy Agenda Agreement 
Model 1 


Interaction of the bonding grp. measure and 306 
vote participation (.511) ; 
Comparable bndglng-grp measure 4.982 18.654 224 —1.783 
(17.791) (25 198) (5.684) (7.083) 
Constant 16.641 14.494 16 694 14.026 
Ad]. R? -33 .30 28 23 
F-test for Increase In R? over base model 3.24" 3.19° 2.7 3.6* 


Se ee TE DASI a a UUR e a le 
Note Cell entries are regression coefficients and, in parentheses, standard errors. “* Significant at <.01. ™ Significant at <.05. 
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Model 2 Model 3 Model 4 






bonding-group membership (estimates most appropri- 
ately made, too, within the range of typical values of 
bonding-group membership). 

Recall that the coefficient for voting participation 
in any full interaction model in Table 3 indicates the 
effect of voting participation on concurrence when 
the bonding-group measure in that model equals zero. 
Yet by appropriately decomposing the full interaction 
equation and solving for the desired values, one can 
estimate the regression coefficient for voting participa- 
tion on concurrence at any specific value of bonding- 
group membership and the t-value associated with 
that coefficient. For example, when the density of 
nominal bonding-group membership is relatively low, 
specifically at one-half a standard deviation below the 
mean of such membership levels across the communi- 
ties (within a range where many cases are observed), 
the estimated coefficient of voting participation on 
mass—elite agreement is 0.088, with a t-value of 3.906 
(Friedrich 1982, 804-20). In contrast, when the density 
of nominal bonding-group membership is relatively 
high, specifically at one-half a standard deviation above 
its mean (also a point where many cases are observed), 
the coefficient for voting participation is only 0.054, 
with a t-value of 2.212. At both values of bonding-group 
membership there is a positive effect of voting turnout 
on mass—elite agreement, but there is a substantially 
weaker effect of voting participation at higher levels of 
bonding-group membership. 
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CONCLUSIONS 


Theory-generating discussions of civic engagement 
have been dominated by a general, overarching expec- 
tation that more of such engagement is better for the 
health of democratic processes. By necessity, because 
of data availability, existing theory-testing scholarship 
has only investigated such general relationships as well. 
That empirical scholarship has also been restricted 
to the examination of policy outcomes only diffusely 
linked to literal representational processes, Yet many 
theoretical discussions of civic engagement have recog- 
nized that distinct forms of such participatory activity 
exist, such as for bridging and bonding social—capital 
organizations, and the latter kinds of organizations are 
expected to have radically different consequences for 
democratic processes. 

We have provided what we believe is the first analysis 
of the relationships of different forms of civic engage- 
ment with democratic responsiveness. We find no sup- 
port for the hypothesized positive effect of bridging- 
group membership, but we uncover strong support 
for two negative effects of bonding-group membership 
on representation. One of these effects is the direct, 
negative association between higher bonding-group 
membership—measured in either of two alternative 
ways—and mass—elite policy agenda agreement. The 
second, also theoretically expected, effect is in the neg- 
ative interaction where higher bonding-group member- 
ship depresses the salutary effect of voting participa- 
tion on representation. Thus we provide considerable 
evidence for the concerns of many scholars that civic 
engagement has a “dark side” for democracy (Fiorina 
1999) and no support for the positive view of such en- 
gagement. 

Our findings on bridging groups will doubtless be dis- 
appointing to many. Why these groups have no effect 
on mass—elite agreement, however, is not clear. Per- 
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Varlable 

Mass—elite policy agenda concurrence 

Mean vote participation score for community 

Mean campaign participation score for community 

Mean partcularized contacting score for community 

Electoral competition 

Percentage of population non-White 

Population slze ın 1967 

Southem community 

Density of nominal bonding civic engagement 

Density of nominal bridging civic engagement 

Density of active bonding civic engagement 

Density of active bndging civic engagement 

Percentage of communtty’s respondents nominal 
members of one or more bonding groups 

Percentage of communtty’s respondents active 
members of one or more bonding groups 

Percentage of community’s respondents nominal 
members of one or more bridging groups 

Percentage of communrty’s respondents active 





TABLE A1. Descriptive Statistics for the Operational Measures In the Reported Analyses 
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haps the numeric scope of their membership in the typi- 
cal community is insufficient (although bridging groups 
are not more limited in this respect than are bonding 
groups). Perhaps the typical breadth of engagement of 
different social groups, classes, ethnic groups, and the 
like is not sufficient for such groups to speak effec- 
tively for the community at large. Our analyses cannot 
resolve this concern, so it remains a task for future 
research. 

It is valuable to remember, however, what our find- 
ings do not imply. Our results are unambiguous for 
community-level consequences of different forms of 
civic engagement. But they should not be interpreted to 
mean that such engagement is unimportant for individ- 
uals. ‘Thus the genesis of different kinds of civic engage- 
ment and their consequences at the individual level re- 
main notable topics for research. Indeed, such research 
may demonstrate associations between civic engage- 
ment and theoretically important, individual-level po- 
litical attitudes and behavior (e.g., Brehm and Rahn 
1997, Uslaner and Conley 2003, and Verba, Schlozman, 
and Brady 1995). 

As might be said of the results of any single study, 
the findings here beg for replication and extension. 
Of particular interest would be analyses that explicate 
more fully the causal linkages between both individual- 
level and group-level bonding social—-capital behavior 
and representational outcomes. Absent such research, 
however, one implication of our findings is clear. We 
should temper the remarkable and often unqualified 
belief in the value of civic engagement that has in- 
spired much of the research discussed at the beginning 
of this article. Yet this is not an observation of despair. 
Our results confirm that conventional democratic pro- 
cesses of citizen participation in competitive elections 
are important determinants of representational out- 
comes. Perhaps our faith and our applied work should 
be invested more in stimulating these processes. 





Mean Standard Deviation 


Range 

















12.8 5.7 1.9-26.7 
0.68 33.7 —B89.2-58.7 
~3.2 26.7 —46.5-57.6 
2.2 21.6 ~33.4-45.7 
2(a) 3(b) 1—4 
11.1 16.5 .08—60.8 
15,256 16,553 312-63,911 
of 1? 0-1 
08 04 02-.18 
09 05 01-21 
04 03 01-.13 
06 03 00-.14 
42 16 13-77 
25 14 04-.61 
38 15 04.74 
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Madison’s Opponents and Constitutional Design 
DAVID BRIAN ROBERTSON University of Missouri—St. Louis 


nderstanding what James Madison’s opponents sought and won at the U.S. Constitutional 

Convention revises our understanding of the founders’ original intentions for the durable 

framework that has structured American political development. The Constitution is the by- 
product of expedient accommodations forced on Madison. Madison sought broad national authority 
independent of state governments and a swift victory for population-based congressional representation. 
Delegates from economically disadvantaged states opposed these plans, seeking instead to nationalize 
only selective public goods, to maintain most state policy autonomy, and to minimize contingencies 
imposed by other governments. Connecticut's delegates, particularly Roger Sherman, played a pivotal 
role in spoiling Madison’s agenda and altering his substantive plans for Constitutional design. Madison’s 
Convention opponents are responsible for a Constitution that nationalized only enumerated public goods 
and imposed potentially high transaction costs on any further nationalization of policy authority. They 
helped make federalism a lasting political weapon used to win substantive policy outcomes. 


quires a clear understanding of the founders’ 

reasons for designing the Constitution the way 
they did. The Constitutional Convention of 1787 built 
the basic framework for making public policy in the 
United States. The Convention delegates understood 
that they were establishing the vital ground rules for 
future political battles over the control of the national 
government. As it turned out, the basic constitutional 
framework has endured, accompanied by unending dis- 
putes about the original intentions of its architects. That 
basic framework shaped the development of American 
politics for two centuries. It is shaping the development 
of American politics now. 

James Madison’s compelling Federalist essays left 
the mistaken impression that the Constitution’s final 
design closely matched his own ideas for American 
government, and that these ideas essentially shaped 
the document. But Madison’s Virginia Plan, presented 
at the start of the Constitutional Convention, was very 
different from the Constitution he later urged the states 
to ratify. Initially, Madison placed a high priority on 
giving the national government very broad powers to 
tax, to regulate all commerce and currency, and to veto 
state laws. Initially, his proposals aimed to sever the 
connection between state governments and national 
policy-makers and to allocate seats in the Senate as 
well as the House of Representatives in proportion 
to state population or wealth. But the Constitution 
signed on September 17, 1787, differs from Madison’s 
initial plan on all these important dimensions. The final 
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Constitution vested the national government with 
much less authority than Madison wanted and allowed 
the states much more authority than Madison thought 
safe. It gave state governments considerable influence 
in national policy-making and an equal share of power 
in the Senate. Even as he was preparing a spirited pub- 
lic defense of the Constitution in the fall of 1787, in 
private correspondence Madison was expressing disap- 
pointment and pessimism about the Convention’s final 
product (PJM 10: 163-64, 209-14; see Hobson 1979, 
217; Matthews 1995, 15). 

Madison’s disappointments resulted from political 
compromises on Constitutional design forced on him 
by Convention opponents whose goals, effectiveness, 
and impact are far too little understood and appreci- 
ated. Scholars who analyze the workings of the Con- 
vention have stopped short of providing a systematic 
explanation of the opposition that Virginia’s propos- 
als provoked. Instead, the Convention often is pre- 
sented as a rather disjointed sequence of individual 
design choices structured by the Virginia agenda, with 
each choice sparking a discrete conflict aligning differ- 
ent delegates in different ways. Analysts usually have 
drawn a simplified portrait of the Virginia Plan’s op- 
ponents, often a portrait overshadowed by a broad 
interpretation of the early American republic rather 
than the actual Convention itself. For many recent 
historians, for example, the disputes in Philadelphia 
marked an episode in America’s continuing struggle to 
define and implement republican ideals (Wood 1969). 
An older historical tradition, still lively in the social 
sciences, has interpreted the Convention’s disputes as 
an early manifestation of the chronic conflict between 
economically advantaged elites and their less privi- 
leged adversaries (Beard 1913; McGuire 2003). In all 
cases, authors have tended to present Madison and 
his allies as a coherent band of sophisticated nation- 
alists challenged by a relatively formless, parochial, 
somewhat reactionary, and largely leaderless opposi- 
tion. But opponents of the Virginia Plan’s provisions 
included some of the shrewdest and most experienced 
republican politicians in the world. This group could 
be expected to include at least a few individuals with 
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a consciously different strategy for Constitutional re- 
form, and the will and ability to fight for it with some 
success. A more systematic account of Madison’s op- 
position can provide a more thorough explanation for 
Madison’s numerous political failures in Philadelphia, 
for the Constitution’s final design, and for its subse- 
quent impact on American politics. 

Who opposed Madison’s ideas for Constitutional re- 
form at the beginning of the Convention? What were 
they trying to achieve? How well did they achieve 
it? Delegates from the economically disadvantaged 
states that lie between Virginia and Massachusetts 
composed the core of the opposition to the Virginia 
Plan. They mobilized against the coalition of six large 
and southern states that Madison expected to unite 
behind his initial agenda. These delegates sought to 
strengthen the central government selectively, nation- 
alizing only a narrow set of public goods, while de- 
fending most of their state’s policy autonomy and its 
equal influence in the national policy-making process. 
Connecticut’s delegation played a pivotal role in this 
Opposition, and Roger Sherman, a skilled Connecti- 
cut politician, emerged as Madison’s most relentless 
antagonist. As they defended their states’ interests, 
the delegates from Connecticut, New Jersey, Delaware, 
and Maryland substantially reshaped Madison’s initial 
blueprint. During the course of the meeting, they de- 
railed Madison’s agenda, united a cohesive opposition 
against broad national powers, won delegates over to 
an innovative theory of dual national and state au- 
thority, and earned important victories for state pol- 
icy autonomy and influence. They turned the process 
of Constitutional design into a series of interdepen- 
dent political negotiations, lasting three and a half 
months. 

Understanding what Madison’s opponents sought 
and won in Philadelphia revises our understanding of 
the enduring framework that has structured Ameri- 
can politics since 1789. The Constitution was largely 
an unanticipated by-product of politically expedient 
compromises rather than the product of a single plan. 
With pragmatic imprecision, the delegates constructed 
an unfinished national policy workplace and stocked it 
with a limited range of policy tools, expecting future 
politicians to use and add to these tools in pursuit of 
their own political agendas. Nothing better exempli- 
fies this thoroughly political nature of Constitutional 
design than dual state and national sovereignty, a con- 
ceptual innovation pressed by Connecticut’s delegates 
over Madison’s objections. The Constitution’s distribu- 
tion of national and state authority represents a series 
of political accommodations between the two sides. 
The delegates nationalized only a limited set of enu- 
merated public goods, established strong political de- 
fenses for the states’ existing advantages, and imposed 
high transaction costs on the further nationalization 
of public goods. Federalism was designed the way it 
has been used ever since, as an opportunistic political 
battlefield with ambiguous boundaries, one that makes 
it possible to displace substantive policy conflicts with 
perpetual quarrels over the boundary between state 
and federal power. 
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THE POLITICAL LANDSCAPE OF THE 
PHILADELPHIA CONVENTION 


The states sent politicians to represent them in 
Philadelphia, not philosophers, professors, or investors. 
Most of the delegates had served in state legislatures, 
offices, or courts. Many of the most active participants, 
such as James Madison and Roger Sherman, had made 
politics a career. Literally, they were state builders who 
had helped design new republican constitutions for 
their states, built new governing institutions, filled new 
public offices, and assembled working political coali- 
tions (McDonald 1958; 1979, 258-70; 1985, 185-224; 
Rossiter 1966, 79-156). Many were unusually conver- 
sant with leading theorists of statecraft and republi- 
canism. By 1787, their state governments had survived 
war and depression as sovereign political entities that 
levied taxes, circulated currency and notes, and regu- 
lated debts, vices, economic associations, public safety, 
and the electoral franchise. 

Many an American politician had a vital stake in 
his state’s independent control over public policy. In 
practice, these republican political leaders could not 
separate public policy from politics. Public policy con- 
structed political alliances and support, while political 
contests turned on the control of the policy-making 
process and its outcomes. They had learned to calibrate 
tax, currency, debt and regulatory laws to create dis- 
tinct and dynamic political republican regimes (on us- 
ing policy to construct political orders, see Skowronek 
1982, 1997; Bensel 2000). They were inventing and us- 
ing policy tools to balance the conflicting demands of 
democracy, order, economic stability and economic de- 
velopment (John 1997; Main 1973; Novak 1996). Like 
organization builders more generally, they placed a 
priority on the survival and autonomy of the states 
they had nurtured along (on organizational behavior, 
see Thompson 1967). By early 1787, however, Shays’s 
Rebellion, state currency policies, and other problems 
in state governance expanded political support for far- 
reaching national government reforms (Brown 1993; 
Matson 1996, 382-84). The delegates shared with a 
number of constituents a deep anxiety about the na- 
tion’s political future, and were willing to consider seri- 
ously substantial increases in specific national powers. 
Though they expressed concern for economic stabil- 
ity, development, investment and property rights, these 
politicians were not simply the agents of wealthy con- 
stituents (Ferguson 1961, 251-86; McDonald 1958, 25, 
37, 86-92). In theory, more effective national gover- 
nance of currency, commerce, and military security 
would provide valuable public goods for their con- 
stituents. In practice, though, any specific addition to 
national power might be used to the detriment of one 
or more states. 

The delegates’ central political dilemma, then, was 
to reconstitute the national government so it could 
provide the national public goods they believed nec- 
essary, without endangering the vital interests of their 
constituents and the polities they had built. Each state 
legislature charged its delegates to render the na- 
tional government “adequate to the Exgencies of the 


American Political Science Review 


Union.” There was widespread agreement that the na- 
tional government needed a steady stream of income 
and some specific additional powers to deal with na- 
tional security, civil unrest, commerce, and currency 
problems. Because the states had different economic 
assets (McCusker and Menard 1985), cultures, histo- 
ries, policy commitments, and political equilibria, how- 
ever, each delegation specifically defined “adequate” in 
a different way. Delegates from New Jersey and Con- 
necticut advocated the nationalization of tariffs and 
currency while they defended other state prerogatives 
(RFC August 16, I, 309; August 28, II, 439). Several 
delegates, including Wiliam Paterson, George Mason, 
and John Dickinson, hoped to ban the slave trade (RFC 
June 9, I, 561; August 22, I, 370, 372). While Luther 
Martin fiercely resisted the expansion of many national 
powers at the Convention, he also insisted on nation- 
alized control over public lands (RFC August 30, I, 
464). Each delegate made his own evaluation about 
the benefits and risks of any specific national power for 
his own state and the nation (see also McGuire 2003 
and Siemers 2002). 

Designing the Constitution required three types of 
choices (Robertson 2005). These choices can be dis- 
tinguished analytically, but the delegates did not make 
them in sequence or independently of one another. The 
first group of choices involved policy agency, that is, 
who would fill national offices, how these offices would 
be filled, and on whom officeholders would depend. 
The second group of choices involved national policy 
authority, that is, the specific policy areas in which the 
national government would exercise authority and the 
conditions placed on the exercise of this power. Ex- 
amples of these choices included national authority to 
regulate interstate and intrastate commerce, lay taxes, 
govern currency, build infrastructure, and protect con- 
tracts and property. The third group of choices involved 
the national policy process, that is, the rules for formu- 
lating, legitimating, and implementing and adjudicating 
public policy, including specific provisions for enacting 
legislation and exercising the veto. 

It is impossible to specify with precision how philo- 
sophical principles affected these Constitutional design 
choices. No delegate disputed the wide support for 
republican ideals, for example, but these ideals pro- 
vided little help for making virtually any of the specific 
design choices the Constitution required. Republican- 
ism specified little beyond the ultimate sovereignty of 
the people and the principle of separation of powers. 
Specific decisions about the selection of policy-makers, 
their authority, and the national policy process could 
not be settled decisively by examples from imagined 
republics or existing states. None of the new state 
republics produced fully acceptable results (Kruman 
1997, 123-26; Wood 1969, 433). Nothing in republi- 
can theory per se specified the precise distribution of 
powers and selection of policy-makers in the final Con- 
stitution. 

Historian Jack Rakove (1996) observed that “what 
is elusive is the interplay between ideas and inter- 
ests” (15). Rakove, political scientist Calvin Jillson 
(1988), and other authors have argued that the del- 
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egates veered back and forth, clashing over ideas at 
some points and over interests at others. It is not possi- 
ble, of course, to penetrate the thinking of any delegate 
and distinguish the way each weighed principle against 
interest on each design issue. The high-minded defense 
of national authority may have masked a delegate’s 
driving ambition for the prestige and power of national 
office (most of the signers went on to hold elected or 
appointed offices in the new government). On the other 
hand, an individual’s defense of state prerogatives may 
reflect a mix of not only parochialism but also prag- 
matism, sincere loyalty to the principle of constituent 
representation, and a deeply held belief in the superi- 
ority of the social, economic, and political order in any 
given state. By expanding the concept of interest be- 
yond personal pecuniary gain and selfish parochialism, 
to include the representation of any state’s autonomy 
and economic interests, it is much easier to see that the 
delegate’ ideas and interests aligned closely with one 
another. 

The delegates frequently invoked philosophical prin- 
ciples in the Convention debates, but these claims 
almost always dovetailed with each delegate’s con- 
stituents’ interests. His constituents’ interests colored 
each delegate’s arguments about the true nature of 
the nation’s interests. Political calculations, conceived 
broadly in this way, shaped delegates’ views of the 
stakes in most of these design choices. Political cal- 
culations and negotiations, rather than isolated philo- 
sophical syllogisms, settled the disputes these choices 
engendered. James Madison skillfully tried to structure 
the Convention agenda to align the ideas and interests 
of a majority of state delegations with his plan. 


JAMES MADISON’S AGENDA FOR 
CONSTITUTIONAL DESIGN 


The delegates’ anxiety about the nation’s future, com- 
bined with the hazy status of specific solutions, opened 
a window of political opportunity for James Madison. 
He prepared for the Convention more thoroughly than 
any other delegate (Banning 1995; Rakove 1996, 46- 
56, 59; Robertson 2003). As a member of the Confed- 
eration Congress, Madison had supported stronger na- 
tional revenue and commercial powers. He approached 
the Convention with a much bolder agenda for reform 
and a strategy for getting the Convention to adopt it. 
Madison sought three kinds of far-reaching changes 
in national government design. First, Madison sought 
to transfer to the national government the complete 
authority over taxes, commerce, and other basic tools 
of economic policy. Madison had become convinced 
that the states had too much authority over economic 
policy, and their abuse of that authority was harming 
the nation’s interests and its republican experiment. 
Much like Adam Smith, Madison believed that gov- 
ernment had an important but limited role, primar- 
ily facilitating trade and market-driven economic de- 
velopment (Fleischacker 2002; Matthews 1995, 314). 
The states’ debt, currency, and trade policies were in- 
hibiting free markets and the nation’s prosperity. In 
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letters to George Washington, Thomas Jefferson, and 
Virginia Governor Edmund Randolph, Madison in- 
sisted on the establishment of a fully sovereign na- 
tional government, “armed,” he told Randolph, “with 
a positive & compleat authority in all cases where uni- 
form measures are necessary” (PJM 9: 370; 317-22, 
382-87). The delegates took it for granted, he claimed, 
that the U.S. national government should “have powers 
far beyond those exercised by the British Parliament 
when the States were part of the British Empire” (RFC 
June 29, I, 464). National officials’ ambitions should be 
driven by a material concern for national advantage 
and nothing else. Madison reasoned by analogy that, 
if state policy authority resulted in the incentive to 
pursue parochial interests, expanded national authority 
could give national government officials the incentive 
to pursue purely national interests. To the end of the 
Convention he argued that complete authority over the 
nation’s commerce, including intrastate commerce, was 
indispensable (RFC August 28, Il, 442; September 15, 
I, 625). Madison also believed that taxation should be 
nationalized, on the grounds that a “compleat power of 
taxation” was “the highest prerogative of supremacy” 
(RFC June 28, I, 447) and “no line could be drawn 
between” authority to regulate trade and authority to 
levy taxes” (RFC August 13, II, 276). Madison also 
sought broad national authority to govern the militias, 
to charter corporations, to develop canals and other in- 
frastructure projects, to found a national university, and 
to establish patents and copyrights (RFC August 18, 
II, 324-25, 332; August 23, O 332; September 14, II, 
615). To anchor this broad national authority, Madison 
proposed the extraordinary national power to veto 
state legislation at will, asserting at the Convention that 
“an indefinite power to negative legislative acts of the 
States” was “absolutely necessary to a perfect system” 
(Hobson 1979; PJM 9: 383-84; RFC June 8, I, 164-68, 
171-72; June 19, I, 319; July 17, I, 28). 

Second, Madison’s basic plan aimed to minimize the 
state governments’ role in choosing national policy- 
makers. Madison believed it essential to avoid “too 
great an agency of the State Governments in the Gen- 
eral one ...” (RFC June 6, I, 134). The Virginia Plan 
proposed that the voters directly choose the members 
of the House of Representatives, the foundation of the 
national policy-making system. The House then would 
elect the Senate from slates proposed by the states, 
and the national legislature would choose the national 
executive and judiciary. Madison described this scheme 
as a “policy of refining the popular appointments by 
successive filtrations” (RFC May 31, I, 49-50), and 
it would disconnect national policy making from the 
states, stabilize popular control of government, and 
deliver better policy outcomes for the nation. The third 
basic element of Madison’s plan was tactical as well as 
substantive: The Convention would have to establish 
what he termed “proportional representation” (that is, 
apportioning seats in the national legislature according 
to population size) as the first order of business, Propor- 
tional representation in both houses of Congress would 
further nationalize policy, by reducing the influence of 
state governments as units of representation, and it 
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would substantially increase the influence of Madison’s 
own constituents by giving Virginia the largest delega- 
tion in Congress. 

Tactically, Madison depended on a swift victory for 
proportional representation to bind together a win- 
ning Convention coalition of six states. The three lar— 
gest states, Massachusetts, Pennsylvania, and Virginia, 
would gain from proportional representation in the 
short run. Georgia and the Carolinas would gain in 
the foreseeable future as their rapidly growing pop- 
ulations eclipsed the eastern states (PJM 9: 318-19, 
369-70, 383). The success of this strategy depended on 
winning early agreement for proportional representa- 
tion in both chambers of the new legislature (Jillson 
1988, 47) and on delaying debate on the substance of 
national authority, particularly the divisive issues of 
tariffs, trade, and public lands. Once the Convention 
adopted proportional representation in the new na- 
tional legislature, this coalition would gel and sweep his 
remaining agenda through the Convention. He wrote 
hopefully to Jefferson that “if a majority of the larger 
States concur, the fewer and smaller States must finally 
bend to them. This point being gained, many of the 
objections now urged in the leading States ag[ain|st re- 
nunciations of power will vanish” (PJM 9:319 see also 
Lee and Oppenheimer 1999, 36-37). Virginia’s dele- 
gates gambled that smaller states could be prevailed 
upon “in the course of the deliberations, to give up 
their equality for the sake of an effective Government” 
(RFC May 28, I, 10-11, note). 

At the Convention’s first working session on 
May 29, Virginia Governor Edmund Randolph offered 
Virginia’s plan in 15 resolutions based on Madison’s 
ideas (Rakove 1990, 55; RFC LI, 525). The Virginia 
Plan laid out a sequence of decisions consistent with 
Madison’s tactics. It made the change from state-based 
representation to proportional representation the first 
order of business. The Virginia Plan’s first five resolu- 
tions established the rules for selecting legislators. Not 
until its sixth resolution did the Plan propose to autho- 
rize the national legislature “to legislate in all cases to 
which the separate States are incompetent, or in which 
the harmony of the United States may be interrupted 
by the exercise of individual Legislation; to negative 
all laws passed by the several States, contravening in 
the opinion of the National Legislature the articles of 
Union; and to call forth the force of the Union ag[ain|]st 
any member of the Union failing to fulfill its duty under 
the articles thereof” (RFC May 29, I, 20-22). 

Madison’s Convention strategy tacitly acknowl- 
edged that the Virginia Plan posed two threats to the 
state polities. First, by targeting the states’ authority 
to manage their economies, the plan imperiled states’ 
autonomy and existing state political regimes. Sec- 
ond, his proposals aimed to create a national govern- 
ment capable of systematically devaluing some state 
assets and extracting and redistributing others. The 
breadth of Madison’s plan to nationalize economic au- 
thority, then, enabled a wide range of opponents to 
unite around concerns about what government would 
do with that power. The very ambition of Madison’s 
plan increased its political vulnerability to delay and 


American Political Science Review 


dismemberment. To alter Madison’s design fundamen- 
tally, Madison’s Convention opponents had to slow the 
progress of his agenda and draw some of his expected 
allies away from his coalition. 


INTERPRETATIONS OF MADISON’S 
CONVENTION OPPOSITION 


Most scholars have not given the same careful, sys- 
tematic attention to the Virginia Plan’s opponents as 
they have to Madison and his Convention allies, such 
as Gouverneur Morris, James Wilson, and Alexander 
Hamilton.! The most familiar narrative of the Con- 
vention frames Madison and his allies as strong na- 
tionalists and the opposition as small states simply 
opposed to the loss of their equal vote in the Con- 
federation Congress. As John P. Roche (1961) put it, 
Madison’s “dedicated nationalists” squared off against 
“self-interested spokesmen of various parochial inter- 
ests” (800). Rakove’s Original Meanings, a peerless 
and penetrating analysis of the Convention’s politics, 
concluded that the “basic purpose” of the authors of 
the New Jersey Plan “was not to narrow the agenda 
of reform but to indicate that they would accept the 
broad changes of the Virginia Plan only if the small 
states retained an equal vote in one house of Congress” 
(Rakove 1996, 65). Many delegates themselves spoke 
in the terms of small and large states. “Give N[ew] 
Jersey an equal vote,” Charles C. Pinckney notably 
accused, “and she will dismiss her scruples, and con- 
cur in the Nat[ional] system” (RFC June 16, I, 255; see 
Roche 1961, 806). William Riker (1996) concluded that 
Madison and his allies indeed “had converted most of 
the moderate nationalists, like Roger Sherman, to their 
extreme position” (18). 

This conventional understanding of Madison’s Con- 
vention opposition, however, seriously misspecifies the 
interests of Madison’s opponents and, therefore, un- 
derestimates their impact on the Constitution’s design. 
It is more accurate to understand Madison’s opponents 
as delegates from the states outside of the coalition of 
six large and southern states he expected to form the 
core support for the Virginia Plan. This group of seven 
states outside his expected coalition—New Hampshire, 
Rhode Island, Connecticut, New York, New Jersey, 
Delaware, and Maryland—cannot be understood sim- 
ply as small states focused simply on defending their 
equal vote in Congress. First, the distinction based on 
size is misleading. South Carolina had a smaller popu- 
lation than Maryland or New York and about the same 
population as Connecticut. Georgia was one of the four 


1 Because this paper analyzes the design of the Constitution pro- 
duced by the Convention on September 17, 1787, it limits its evidence 
to the records of the Convention debates and excludes information 
from the subsequent debates over ratification. Ratification tracts, 
such as the Federalist Papers, were polemical essays written to urge 
support for ratification (Finer 1997, 3, 1495-96; Roche 1961) and 
provide a partisan account of the Convention’s design choices. James 
Madison’s notes on the debates provide the most complete and re- 
lable record of the Convention p The notes of some 
other of the other delegates provide reliable supplementary notes 
information (Hutson 1987). 
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least populous states. Second, several delegates outside 
of Madison’s expected coalition were strong support- 
ers of national reconstitution despite their opposition 
to many features of the initial Virginia Plan. Shlomo 
Slonim (2000) showed that the small states’ views were 
more complex than the conventional portrait, and they 
enjoyed more successes than equal state representation 
in the Senate. Four of the states most opposed to the 
Virginia Plan at the Convention ratified the Constitu- 
tion relatively quickly and by large margins: Delaware 
and New Jersey unanimously, Maryland by 85%, and 
Connecticut by 76%. Virginia, in contrast, ratified 
the Constitution with by only a 53% margin. Third, 
some of the delegates from whom Madison anticipated 
strong support came to oppose the final Constitution. 
Virginia’s George Mason and Massachusetts’ Elbridge 
Gerry, for example, refused to sign the document and 
opposed ratification. 

Finally, if Madison’s Convention opponents sim- 
ply sought equal representation in Congress, then the 
states outside of his expected six-state coalition should 
have been assuaged after the acceptance of the bi- 
cameral (Connecticut) compromise on July 16 and less 
inclined to challenge Madison’s positions. Yet the del- 
epates from the four states represented throughout the 
Convention (Connecticut, New Jersey, Delaware, and 
Maryland) were more active in debate after July 16 
than before (Figure 1). These later debates often turned 
on issues of policy authority and relative institutional 
power instead of legislative apportionment. Slonim ar- 
gued that conflict continued because of philosophical 
disagreements about state sovereignty in the abstract. 
But the delegates were not struggling over abstract 
principle. Rather, they were battling over the relative 
position of their diverse state political economies, over 
the surrender of specific public goods to the national 
government, and over the states’ agency in determin- 
ing how national power would be used. Given that 
different delegates from these states defined adequate 
national powers in different ways, Slonim’s (2000) con- 
clusion that “these delegates opposed every attempt to 
engulf the states in a tidal wave of national authority” 
(19) greatly oversimplifies their political motivations, 
constituency interests, tactics, and influence on the out- 
come. 

By distinguishing three groups of delegates at the 
Convention instead of two, Thornton Anderson (1993, 
62) greatly furthered our understanding of those del- 
egates who opposed the Virginia Plan but embraced 
the final Constitution. Anderson’s first group was the 
“nationalist” bloc, including Madison and such allies 
as James Wilson and Gouverneur Morris. The second 
group, “state’s rights proponents,” included Maryland’s 
Luther Martin and John Francis Mercer and New 
York’s Robert Yates and John Lansing, all of whom 
left the Convention and opposed the ratification of 
the Constitution. Anderson labeled the third group 
“state federalists.” These 15 “state federalists” occu- 
pied a middle ground on state powers. He labeled as 
“state federalists” all of the delegates from Connecticut 
as well as John Dickinson from Delaware. The cate- 
gory also includes a number of delegates from states 
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FIGURE 1. 
Maryland Delegates at the Convention 
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Madison counted on to support his plan, including all 
four of the South Carolina delegates, two from North 
Carolina, and three delegates from large states who 
refused to sign the Constitution (Elbridge Gerry of 
Massachusetts and George Mason and Edmund 
Randolph of Virginia). Anderson’s typology identi- 
fies the need for a closer analysis of those “state fed- 
eralists” outside Madison’s expected coalition. Given 
Madison’s experience, skill, and preparation, it is im- 
plausible to believe that Madison was politically naïve 
when he included the southern “state federalists” in 
his expected coalition. The more likely explanation 
is that his political strategy broke down during the 
Convention and that some of these “state federalists” 
from outside of Madison’s expected coalition caused a 
critical mass of expected supporters within it to defect 
from key parts of his plan during the course of the 
Convention. 


THE ECONOMICALLY DISADVANTAGED 
STATES 


Madison initially may have been encouraged by the 
weaknesses of his potential opponents. Because New 
Hampshire’s delegates did not arrive until July 23 and 
Rhode Island was not expected to send a delegation 
(Hutson 1987, 67), Madison’s anticipated coalition of 
six states could have outvoted the five remaining states 
during the Convention’s first two months. The two New 
York delegates who walked away from the Convention 
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Speeches, Motions, and Seconds from Connecticut, Delaware, New Jersey, and 





left their state without a sufficient number to cast an 
official vote on behalf of their state during final three- 
fifths of the Convention’s working sessions. The com- 
plete absence of Rhode Island and the long absences of 
New York and New Hampshire placed the burden of 
opposing Madison squarely on four states represented 
throughout the Convention. These delegates proved to 
be superbly adept at defending their political interests. 

Virginia’s plan particularly sparked opposition from 
specific states south of Massachusetts and north of 
Virginia—Connecticut, New Jersey, Delaware, and 
Maryland—because these states had substantial eco- 
nomic and political disadvantages compared to their 
neighbors. They lacked the expanses of land and the 
long growing season enjoyed by the states to their 
south. They also lacked the lucrative ports of Boston, 
New York, and Philadelphia and were very vulnera- 
ble to exploitative tariffs and regulations from nearby 
states with these large ports. The architecture of the 
Confederation had leveled the playing field somewhat 
for these states because their equal vote in the Con- 
federation Congress substantially offset their relative 
economic disadvantages. With limited resources and 
significant economic exposure, these states actually had 
the most stake in nationalizing specific public goods 
such as coastal defense and tariffs, in blocking other 
states from restricting commerce, and in enforcing the 
states’ financial obligations to the Confederation (on 
the problem of collective action in the Confedera- 
tion, see Dougherty 2001). These states could not ade- 
quately provide such public goods as national defense 
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and needed protection from economic damage spilling 
over from the tariffs, paper emissions, and credit poli- 
cies of contiguous states. But these states’ relatively 
limited economic assets and their vulnerability to larger 
neighbors made it even more vital for them to defend 
their remaining economic prerogatives and political 
advantages forcefully. New York was a special case. 
New York faced the surrender of its lucrative tariff 
revenues to the national coffers. Because these tariffs 
allowed the state to export some of the costs of its gov- 
ernment, it potentially confronted the largest political 
sacrifice of any state (De Pauw 1966, 31-32). 

The representatives from Connecticut, New Jersey, 
Delaware, and Maryland formed the core of Madison’s 
Convention opposition. Delegates from these four eco- 
nomically disadvantaged states generally shared three 
goals for Constitutional design. First, they sought to 
maintain their state government’s control over taxes, 
internal commerce, regulation, and other policy tools 
critical for managing republican politics and develop- 
ment inside their borders. Second, they sought to mini- 
mize contingencies imposed by other governments, in- 
cluding negative spillovers from the tariff and currency 
policies of neighboring states and harmful policies that 
might be pursued by a stronger national government. 
Third, they sought to maintain their economic capa- 
bilities and economic position relative to other states. 
They feared not only the superior resources of other 
states, but also future relative gains by these priv- 
Ueged states, gains that could allow other states to 
benefit themselves at the expense of less advantaged 
neighbors. Grieco (1990, 27-50) argued that nation- 
states in international relations should not be under- 
stood simply as atomistic actors seeking to maximize 
gain. Independent nation-states are motivated not only 
to protect against the absolute gains of other states 
and compliance with any international agreement, but 
also to protect their relative position. Greico termed 
these nation-states “defensive positionalists.” The del- 
egates from the economically disadvantaged Ameri- 
can states acted as “defensive positionalists” at the 
Convention. 

These delegates claimed that Virginia’s plan would 
create not only absolute gains for states in Madison’s 
coalition, but also relative gains for these states com- 
pared to those states outside his coalition. Madison’s 
strategy raised understandable suspicions among these 
unsentimental political realists) Delaware’s Attorney 
General, Gunning Bedford, Jr., characterized 
Madison’s Convention strategy explicitly and darkly 
during the Convention’s most bitter day of debate: 


If political Societies possess ambition avarice, and all the 
other passions which render them formidable to each 
other, ought we not to view them in this light here? Will not 
the same motives operate ın America as elsewhere? If any 
gentleman doubts it let him look at the votes. Have they not 
been dictated by interest, by ambition? Are not the large 
States evidently seeking to aggrandize themselves at the 
expense of the small? They think no doubt that they have 
right on their side, but interest had blinded their eyes. Look 
at Georgia. Though a small State at present, she is actuated 
by the prospect of soon being a great one. [South] Carolina 


is actuated both by present interest & future prospects. 
She hopes too to see the other States cut down to her 
own dimensions. [North] Carolina has the same motives of 
present & future interest. Virg[inia] follows. Mary[land] is 
not on that side of the Question. Pen[nsylvania] has a di- 
rect and future interest. Mass[achusetts] has a decided and 
palpable interest in the part she takes. Can it be expected 
that the small States will act from pure disinterestedness. 
(RFC June 30, I, 491) 


THE PIVOTAL ROLE OF CONNECTICUT 
AND ROGER SHERMAN 


Connecticut’s delegates led the opposition, relentlessly 
disputing Madison’s most important Convention pro- 
posals. Connecticut also became the pivotal delegation 
by taking a key role in framing and then defeating 
the New Jersey Plan, pressing for the “Connecticut” 
compromise on House and Senate apportionment, 
providing northern support for protecting the slave 
trade, and injecting state agency into presidential se- 
lection. The concept of pivotal delegation draws on 
Congressional scholars’ concept of pivotal legislators 
(Brady and Volden 1998; Krehbiel 1998; Rodriguez and 
Weingast 2003). At the Constitutional Convention, the 
pivotal unit was the state delegation instead of indi- 
vidual delegates, because delegates from any state cast 
votes together as a single unit, as in the Confederation 
Congress. 

In the 85 working sessions of the Convention af- 
ter the presentation of the Virginia Plan on May 29, 
Connecticut’s three delegates spoke, made motions, or 
seconded motions 265 times—more than the contribu- 
tions of the five delegates from Maryland (119 times), 
five from Delaware (103), and five from New Jersey 
(38) combined (Table 1). Thorton Anderson’s (1993) 
factor analysis confirms that Connecticut played a dis- 
tinctive role in the Convention (207-14; see also Ulmer 
1966). 

Roger Sherman led an unusually cohesive and 
pragmatic three-man Connecticut delegation (Rakove 
1996, 86). At the Convention, Sherman was the most 
active spokesman among the Virginia Plan’s oppo- 
nents. Madison and Sherman took positions explic- 
itly opposed to one another on 39 occasions. Sherman 
spoke, made motions, or seconded motions 160 times, 
much more often than John Dickinson or Maryland’s 
Luther Martin and 10 times more often than any del- 
egate from New Jersey. Madison himself spoke, made 
motions, or seconded motions 177 times. Sherman’s 
talented younger colleague, Oliver Ellsworth, greatly 
admired Sherman and looked to him as his model. 
Sherman and his protégé Ellsworth rarely expressed 
differences (though when they did, Ellsworth was will- 
ing to provide marginally more power to the national 
government than was Sherman; RFC August 9, I, 
232; August 23, II, 385). William Samuel Johnson, who 
was somewhat more open to national authority than 
Sherman, had been Sherman’s legal mentor (Collier 
1971, 234). Thirty years older than Madison, Sherman 
came to the Convention with a formidable political 
reputation as a leading American statesman. He had 
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TABLE 1. Convention Speeches, 
Motions, and Seconds of Motions by 
Delegates from Four Continuously 
Represented States Outside 
Madlson’s Expected Six-State 
Coalition 
Connecticut total 

Roger Sherman 

Olrver Ellsworth 

Willlam S. Johnson 
Delaware total 

John Dickinson 

George Read 

Jacob Broom. 

Gunning Bedford, Jr. 

Richard Bassett 
New Jersey total 

Jonathan Dayton 

Willlam Paterson 

David Brearty 

Willlam C. Houston 

William Livingston 
Maryland total 

Luther Martn 

Daniel Carroll 

John F. Mercer 

James McHenry 

Dan’! of St. Thomas Jenrfer 





served on the congressional committees that wrote the 
Resolves of 1774, the Declaration of Indepen- 
dence, and the Articles of Confederation. Only 
Sherman signed all three of these documents and 
the Constitution. In the mid-1780s Sherman was 
serving—simultaneously—as mayor of New Haven, a 
member of the Connecticut’s Council of Assistants 
(which functioned as the state senate), a judge on the 
Connecticut Superior Court, and a delegate to the Con- 
federation Congress. 

Sherman had strong intellectual differences with 
Madison and aimed to achieve a much narrower 
agenda of reform. He advocated as much direct, equal- 
state agency in national policy-making as feasible, 
as little nationalization of public goods as neces- 
sary, and as much state control of everyday economic 
management as possible. While he also supported 
certain stronger national powers, Sherman preferred 
that the scope of national authority should be ex- 
panded in a much more limited and specific ways. 
No one argued earlier and more forcefully against 
the conventional wisdom that sovereignty could not 
be divided. Instead, he argued for the idea of dual 
sovereignty, insisting that the state and national gov- 
ernments should have discrete areas of policy re- 
sponsibility (RFC May 30, I, 34-35; June 6, I, 133; 
June 7, I, 150). He conceded that the Confederation 
government required additional powers, “particularly 
that of raising money” and the national management 
of currency and interstate commerce, both pressing 
interests for a state squeezed by Rhode Island’s paper 
currency and New York’s tariffs. Circumstantial evi- 
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dence indicates that he supported national authority to 
assume the states’ debts (Currie 1997, 77; RFC August 
18, IL, 326-68, August 21, II, 355-56). States should 
be prohibited from issuing currency or interfering with 
contractual obligations. But Sherman insisted from the 
start that states should retain control of “matters of 
internal police... wherein the general welfare of the 
United States is not affected” (Collier 1971, 230-33; 
RFC May 30, I, 34). While Madison believed that the 
burden of proof for retaining economic policy author- 
ity rested on the states, for Sherman, the burden for 
absorbing such authority rested on the national govern- 
ment. Sherman argued, for example, that states should 
retain even the power to embargo goods. A plan he 
wrote early in the Convention would have retained the 
Confederation system of requisitions, which allowed 
the states to fund the national government through rev- 
enues of their choosing. Congress should be the agent 
of the states, he believed, with national policy-makers 
ultimately selected and paid by the state governments 
(Collier 1973, 10, 47, 61, 68, 71-73; RFC LL, 439-40; II, 
615-16). Madison ally Rufus King thought Sherman fa- 
vored “a sort of collateral Government” (RFC June 6, 
I, 142-43). 

Roger Sherman’s direct confrontations with James 
Madison demonstrate Sherman’s influence on the Con- 
stitution’s design. Madison lost many battles at the 
Convention. By Forrest McDonald’s (1985) count, “Of 
seventy-one specific proposals that Madison moved, 
seconded, or spoke unequivocally in regard to, he 
was on the losing side forty times” (208-9). Table 2 
lists the 39 occasions on which Sherman and Madison 
took explicitly conflicting positions at the Conven- 
tion. Three of these disagreements involved dif- 
ferences of interpretation that did not bear on a 
specific issue under consideration. Seven resulted in 
compromises between their positions. Of the remain- 
ing 29 disputes, the Convention ultimately adopted 
Sherman’s position 19 times and Madison’s position 10. 
Conspicuously, the Convention chose Sherman’s posi- 
tion over Madison’s more frequently as the Conven- 
tion wore on. After a Committee of Detail produced 
a draft Constitution on August 6, the Convention took 
Sherman’s position 13 times and Madison’s only twice. 

Sherman’s leading role among Madison’s Conven- 
tion opponents becomes clearer in comparison to 
role of prominent delegates such as Delaware’s John 
Dickinson, Maryland’s Luther Martin, and New 
Jersey’s William Paterson. Dickinson, a more re- 
nowned national figure than Sherman, could not par- 
ticipate during important stretches of the Convention 
because of his health (Hutson 1983). Like Sherman, 
Dickinson also strongly supported equality of state 
votes in the Senate, but he was much more supportive 
of Madison’s expansive vision of national authority. 
Dickinson explicitly agreed with Madison more often 
than he disagreed with him. While he agreed with 
Madison on 10 occasions, including the defeated na- 
tional veto of state laws, he disagreed with Madison 
on only seven occasions, and of these disagreements, 
the final outcome reflected his position only four 
times (Sherman agreed in two instances and took 
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TABLE 2. Explicit Confilcts between James Madison and Roger Sherman at the Constttutlonal 


Convention 
Date 
(1) May 31 
(2) Juned 
(3) Juned 
(4) Juneé6 
(5) June 7 
(6) June 12 
(7) June 12 
(8) June 13 
(9) June 26 


(10) June 28 


(11) June 29 


(12) July 2 
(13) July 10 
(14) July 13 
(15) July 14 
(16) July 17 
(17) July 18 
(18) August 7 


(19) August 13 
(20) August 15 


(21) August 16 
(22) August 17 


(23) August 18 
(24) August 20 
(25) August 21 
(26) August 23 


(27) August 23 
(28) August 24 


(29) August 28 


(30) August 28 


(31) August 30 
(32) August 31 


(33) August 31 


(34) September 8 
(35) September 8 
(36) September 12 
(37) September 14 


(38) September 15 
(39) September 15 


Issue 
Popular election of House 
Popular ratification 
Infertor federal courts 
Direct election of House 
State legislatures elect Senators 
House term 
Senate term 
Congrees appoints Supreme Court 
Senate term 
Proportional representation in the 
House 
Faults of Confederation 
Commit representation dispute 
to committee 
Double size of House to 
130 members 
Direct taxation determined by 
number of Representatives 
Moderate plan for proportional 
representation In Senate 
National veto of state laws 
President appoints judges 
Fix Congress meeting date 
States contro! citizenship rights of 
naturallzed citizens 
Jolnt veto exercised by president & 
Judges 
National authortty to tax exports 
Congress authority to “declare” 
rather than “make” war 
Broad national mliia authority 
Broad definition of treason 
National authority to tax exports 
National appolntment of top militia 
officers 
Commit national veto of state laws 
Jolnt House—Senate ballot for 
electing president 
Absolute prohibition on state 
em 
Absolute prohibition on state tariffs 
Spectfy U.S. clalms to lands 
States required for ratification 


Ban states from requiring duties 
for throughbound ships 
Senate tnal of Impeachments 


Increase number of Representatives 


Reduce veto override to 2/3 

National authority to cut canals 
and charter corporations 

National authority over commerce 


Specify congressional power to 


Madison 
Position 


For 
Against 


7 with a majority 
of population 


Agalnst 


Against 
For 
Against 
For 


Against 


Sherman 
Position 


[Interpretive 


dispute] 
Against 


Compromlse 


Sherman 
Sherman 
Shennan 


Sherman 
Madison 


Sherman 


Compromise 
Compromlse 
Compromlse 


Sherman 


Sherman 
Sherman 
Sherman 
Sherman 


Sherman 


delegate Inferior appointments 
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no position in the other two).? Maryland’s querulous 
Attorney General, Luther Martin, arrived two weeks 
late and left two weeks early. While Martin defended 
state prerogatives more stridently than any other dele- 
gate, he proved strikingly ineffective in translating this 
antagonism into concrete results. Martin took positions 
on 43 items acted on by the delegates, but only four 
of these positions successfully translated into a sub- 
stantive provision in the final Constitution. Moreover, 
only two of these four “wins” for Martin’s position con- 
stituted “losses” for Madison’s: the defeat of the na- 
tional veto of state laws and the defeat of an executive 
veto exercised in conjunction with other national of- 
ficers (although a majority of Martin’s Maryland col- 
leagues voted for it). A third Martin win was significant 
but uncontroversial. Immediately after the defeat of 
the national veto, Martin made the motion that in- 
jected the New Jersey Plan’s supremacy clause into the 
Constitution. No delegate dissented (RFC July 17, Il, 
27-29; July 21, I, 76-80). 

American politics textbooks often give New Jersey’s 
William Paterson the premiere role as an opponent 
of the Virginia Plan, and like Madison himself, these 
texts use the terms “New Jersey Plan” and “Paterson 
Plan” interchangeably (Cummings and Wise 2001, 41; 
RFC June 19, I, 315). But Paterson was neither the 
sole author of New Jersey’s proposal nor a particularly 
effective opponent of the Madison coalition. The New 
Jersey Plan resulted from the collaboration of those 
excluded from Madison’s expected coalition: the dele- 
gations from Connecticut, New York, New Jersey, and 
Delaware, along with Martin (O’Connor 1979, 145-47; 
RJC ID, 174, 178-79). Beyond introducing the plan on 
June 15 (it was defeated four days later), Paterson’s 
own contributions consisted almost entirely of five 
speeches defending small states and seven efforts on 
procedural issues that affected his state. He seconded 
only two substantive motions, one of which supported 
Madison ally Rufus King’s plan to select the president 
with state-designated electors (RFC June 19, I, 30). 
Although Paterson himself said that it was necessary 
to “mark the orbits of the States with due precision” 
(RFC June 9, I, 179), he contributed nothing to the 
proceedings after July 23, when the Convention dealt 
with the importation of slaves, the rules for making 


2 Dickinson agreed with Madison and disagreed with eather Sherman 
or Ellsworth on national inferior courts (RFC June 5, I, 125), direct 
election of the House (June 6, I, 136), the national veto (June 8, 
I, 167), a three-year term for House members (June 21, I, a 
limiting the national electorate to freeholders (August 7, I, 202), 
state provision of salaries for national legislators (August 14, IL, 292), 
the definition of rebellion against the United States (August 17, II, 
318), the definition of treason (August 20, I, 349), and the national 
taxation of exports (August 21, 361) He also supported Madison on 
the creation of an executive council (September 9, IT, 542). Dickanson 
opposed Madison on: the joint exercise of the executive veto (June 6, 
I, 140), the selection of the Senate by state legislatures (June 7, L, 153), 

i qualifications for federal office holders (July 26, 
I, 123), the ongmation of revenue bills m the House (August 13, II, 
278), the definition of felonies on the high seas (August 17, IL, 316), 
national government control of the militia (August 18, II, 331), and 
delegating an unresolved presidential election to the entire Congress 
instead of the Senate (September 5, I, 513). 
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commercial treaties, control over public lands, pres- 
idential and Senate powers, the amendment process, 
and many other controversial issues. 

It has been as easy to underestimate Sherman’s in- 
fluence on the Constitution’s design as it has been to 
overestimate Paterson’s. Unlike Madison, he left virtu- 
ally no written record of his Convention preparations 
or participation. His reported Convention speeches 
tended to be brief and pragmatic. He did not help 
write the Federalist papers, and his subsequent po- 
litical career pales in comparison to Madison’s. He 
lacked Madison’s talent for academic theory, James 
Wilson’s earnest support for popular government, and 
Gouverneur Morris’s talent for sound bites. But his 
engagement in Constitutional design was far-reaching. 
The ongoing, implicit discourse between Madison and 
Sherman is indicated by the strong day-to-day corre- 
lation between their speeches and motions (r=.565). 
The rivalry between Madison and Sherman gave the 
Convention “much of its drama and fascination,” ac- 
cording to Rakove (1996, 92). 

A close reading of Madison’s own Convention notes 
shows how Sherman put his formidable political skills 
to work for his agenda. Fellow politicians considered 
Sherman shrewd and “cunning as the devil” (RFC I, 
88-89; Rossiter 1966, 91). They credited him with such 
political coups as the acquisition of Connecticut’s West- 
ern Reserve in Ohio, an area physically the size of 
Connecticut itself (Boardman 1938, 160; Collier 1971, 
15, 146-48; PJM 9, 61). Sherman was the first dele- 
gate to inject concerns about political feasibility into 
the debates, warning the Convention against making 
“too great inroads on the existing system... by in- 
serting such as would not be agreed to by the States” 
(RFC May 30, I, 34-35). On some of the most critical 
Convention issues—representation in the Congress 
and presidential power and selection—Sherman skill- 
fully declared an impasse, urged that the issue be dele- 
gated to a committee, and then served on the commit- 
tee and shaped its solution (RFC July 2, I, 511; July 9, I, 
560; August 31, I, 481). These ad hoc committees took 
up the Convention’s most divisive design issues, forged 
a coalition of committee members (generally one from 
each state delegation) around mutually dependent de- 
sign agreements, and reported a plan that structured 
the subsequent voting agenda on these issues. In all, 
Sherman managed to serve on five key Convention 
committees that played a large role in settling many 
of the Convention’s most fundamental and sensitive 
political issues: congressional representation, the ap- 
portionment of House seats, the national assumption 
of state debts, the regulation of tariffs and ports, and 
presidential selection and powers (RFC July 5, I, 526; 
July 9, I, 652; August 18, II, 322; August 25, II, 418; 
August 31, I, 481). For the most part, the reports of 
these five committees became incorporated into the 
final Constitution. 

Sherman skillfully used rhetoric, timing and com- 
promise to manipulate the design agenda and alterna- 
tives (Riker 1996 termed these skills “heresthetics”). 
Sherman slashed at Madison’s abstractions with 
Ockham’s Razor, incisively questioning the need for 
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such proposals as the national veto, proportional rep- 
resentation in both houses of Congress, and special rat- 
ifying conventions for the new Constitution. Sherman 
used his adversaries’ own claims to frame issues in a 
way that weakened their resolve and united Madison’s 
opponents. By comparing states to individuals, for ex- 
ample, Sherman drew on the emotionally powerful and 
widely shared narrative of republicanism itself to de- 
fend equal state influence (RFC June 28, I, 450). While 
Madison reasoned that “extending the sphere” of gov- 
ermment would strengthen the national government by 
helping control factions, Sherman reversed Madison’s 
logic with the pithy equation that “[t]he small States 
have more vigor in their Gov[ernmen]ts than the large 
ones, the more influence therefore the large ones have, 
the weaker will be the Gov[ernmen]t” (RFC July 7, I, 
550). Sherman displayed a keen sense of political tim- 
ing, injecting a telling point or exploiting his opponents’ 
concessions at opportune moments. Most notably, 
Sherman proposed a bicameral compromise on repre- 
sentation at the start of the week’s business on Monday 
morning, June 11, an optimal moment for undercutting 
the Virginia Plan with an alternative agenda. He staked 
out positions diametrically opposed to Madison and 
then expressed a readiness to compromise on a middle 
ground, drawing the delegates to his apparently mod- 
erate position while making Madison’s position seem 
unwarranted or extreme (Collier 1971, 147; RFC June 
21, I, 359; June 28, I, 450; July 7, I, 550). Like Madison, 
Sherman pressed his agenda when he perceived an 
advantage and tacked toward conciliation when the 
political winds were less favorable (Collier 1971, 177; 
1973, 10, 47, 61, 68, 71-73; Rossiter 1966, 91). 

Madison’s Convention opponents changed his pro- 
posed design for the Constitution by derailing his po- 
litical strategy for the Convention and then altering 
his substantive plans for national authority and policy- 
making. As a result, Constitutional design emerged 
more as a by-product of many specific design choices 
than as a deliberate choice of one plan instead of an- 
other. The Convention’s choices were path-dependent, 
made in a sequence in which choices reflected political 
calculations set in motion by earlier choices and then 
affected subsequent political calculations (on path de- 
pendence, see Pierson 2000). 


SPOILING MADISON’S POLITICAL 
STRATEGY 


Madison’s attack on state influence in the national 
government immediately began to mobilize his op- 
ponents. On the first full day of debate, Madison 
groped for language that would commit the Conven- 
tion to population-based representation in both leg- 
islative houses. The Convention tabled the issue af- 
ter Delaware’s delegates invoked their instructions to 
protect equal state votes in Congress (RFC May 30, 
I, 34-35, 37, 42). The longer the delegates put off the 
resolution of the representation issue, the greater the 
opportunity for Madison’s opponents to raise pointed 
questions about the fairness and the consequences of 
proportional representation. Dickinson and Sherman 


struck at the heart of Madison’s Convention strategy 
by urging the delegates to focus on the specific au- 
thority required for the national government instead 
of abstract principles of representation in it (RFC 
May 30, I, 34-35, 42; June 7, I, 150). This tactic in- 
vited the delegates to factor the nation’s most divisive 
interstate conflicts into their decisions about the rule 
of representation. Sherman’s claim that “the objects 
of the Union... were few” lured Madison himself into 
discussing the acutely divisive race issue openly for the 
first time (RFC June 6, I, 133-36). 

Opposition to population-based representation in 
Congress was only a part of a broader defense of 
state agency in national policy-making. At every turn, 
Sherman pressed to enlarge the state legislatures’ 
influence over national policy-makers. He questioned 
the need for direct election of the House, selection 
of the Senate by the House, presidential veto power, 
creation of inferior national courts, and bypassing of 
state legislatures to ratify the Constitution. He insisted 
that the national executive be “absolutely dependent” 
on Congress (RFC June 1, 4-5, I, 65, 68, 99, 122; June 
5, I, 122, 125; June 6, 133; June 7, 151, 154). When 
Madison ally James Wilson warned that state officials, 
the principal “losers of power,” were the chief obstacles 
to popular control of Congress, Sherman portrayed his 
opponents’ position as radical (RFC June 6, I, 133). On 
June 7, Dickinson moved that members of the Senate 
be chosen by the state legislatures. Sherman seconded 
him. The motion forced Madison to concede that, if 
adopted, the provision would either require a huge 
Senate or would “depart from the doctrine of propor- 
tional representation” (in a footnote, Madison implied 
that the motion was meant to undermine proportional 
representation in the Senate). Nevertheless, the 
Convention gave the proposal unanimous approval. 
Virginia’s George Mason made the important con- 
cession that the state legislatures “ought to have 
some means of defending themselves” against the “en- 
croachments” of the national government (RFC June 7, 
I, 150-52). 

Madison’s Convention opponents provided increas- 
ingly precise and credible warnings about the threat 
to the rest of the states if the largest ones could 
dominate national policy. Gunning Bedford accused 
Virginia and Pennsylvania of conspiring to exercise 
“an enormous & monstrous influence” that would 
shape commercial and manufacturing policy (RFC 
June 8, I, 167-68, 172). David Brearley, Chief Justice of 
the New Jersey Supreme Court, then postulated that 
proportional representation would allow three states, 
Virginia, Pennsylvania, and Massachusetts, to “carry 
every thing before them in Congress” (RFC June 9, I, 
177). Sherman soon argued that, without equal state 
representation in at least one branch of Congress, 
“three or four of the large States would rule the others 
as they please” (RFC June 20, I, 343). Oliver Ellsworth 
hypothesized a commercial treaty enacted by three or 
four large states to benefit their ports at the expense 
of other states. He warned that in a Congress en- 
tirely based on proportional representation, “4 States 
will Gov[er]n 9 States ...” (RFC June 30, I, 484-85, 
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TABLE 3. Votes Cast In Opposition to the Virginia Delegation on 28 Contested 


Convention Votes, May 31—June 19 
CT N SC 
17 14 


Votes cast against 22 
Virginla’s postition 


Percentage 79% 61% 50% 46% 30% 36% 32% 20% 18% 14% 





I, 503). The more Madison and Wilson denied that 
the largest states would collude to dominate rest, the 
more the apparent Convention alliance of the Virginia, 
Pennsylvania and Massachusetts delegations belied 
that argument (see Table 1, McDonald 1985, 219, and 
RFC June 19, I, 321; June 28, 447-48). 

When Madison needed quick and decisive victories 
in the Convention’s first three weeks, Connecticut’s 
delegation fought him more than any other. Table 3 
shows the number of times a state cast a vote contrary 
to Virginia’s “aye” or “nay” vote on the 28 contested 
votes through June 19, when the delegates rejected the 
New Jersey Plan. The notion of contested votes at the 
Convention is inspired by the idea of Congressional 
Quarterly’s “key votes;” fixed rules were used to iden- 
tify the most closely divided votes? This table provides 
a rough approximation of opposition to Madison’s 
own preferences (as the Convention slowly progressed 
into July, Virginia’s vote occasionally departed from 
Madison’s preferences). Connecticut voted against 
Virginia most of the time. New Jersey, Delaware, and 
South Carolina voted against Virginia about half the 
time. South Carolina, which also sent a politically 
talented delegation, consistently backed Virginia on 
proportional representation, while it tended to vote 
against Madison in favor of protecting state agency in 
national policy-making. At the same time, the partner- 
ship of Virginia, Massachusetts and Pennsylvania on 
most votes could hardly be missed by friend or foe. 

The opposition congealed. On Monday morning, 
June 11, Sherman moved that suffrage in the first house 
be proportional to free inhabitants and that states be 
equally represented in the Senate. “[E]ach State ought 
to be able to protect itself,” claimed Sherman, throwing 
Mason’s concession back at Madison (RFC June 7, I, 
152-55; June 11, I, 196). When Sherman and Ellsworth 


3 The official Convention journal recorded 569 numbered votes. Of 
these, 205 met the following criteria for contested votes. First, these 
were votes in which no fewer than 25% of the states opposed no more 
than 75% of the states; when 10, 11, or 12 total votes were cast, at least 
three states voted “aye” or “no” against the majority, and when eight 
or nine total votes were cast, at least two states voted “aye” or “no” 
against the majority (a borderline vote was included as a contested 
vote; for example, Vote 5, a vote of six for, two against, and two 
divided, on the issue of popular election of members to the House, 
also was included). Second, these were politically substantive votes, 
procedural votes (such as adjournment or to commit a provision to 
committee) were excluded unless the record provided some sort of 
evidence that the vote was a proxy for a substantive issue Third, the 
votes had to be confirmed by two sources; because both Madison 
and the official Journal are sometimes inconsistent, all votes were 
excluded that were not confirmed by both sources or by one source 
and another delegate’s notes Twenty-eight contested votes occurred 
from May 29, when the Virginia Plan was presented, through June 19, 
when the New Jersey Plan was defeated. 
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pressed for a quick vote, Madison’s coalition of large 
and southern states turned back Sherman’s motion by a 
six-to-five margin. This ballot showed that Connecticut, 
New York, New Jersey, Delaware and Maryland had 
solidified into an opposition bloc instead of drifting 
individually to Madison’s position, as he had hoped. 
Four days later, Madison’s opponents introduced their 
collective “New Jersey” Plan, aiming to change the 
agenda by enumerating a limited number of specified 
public goods minimally necessary for ensuring ade- 
quate national power. The New Jersey Plan included 
most of the additional enumerated powers favored by 
Sherman and Dickinson, including authority to impose 
tariffs on foreign goods, to impose taxes on postage and 
to regulate interstate and international commerce. The 
plan added that national laws would “be the supreme 
law” and binding on the states, subtly a much more 
passive form of national authority that invited the na- 
tional courts, rather than Congress, to set limits on 
state policy discretion. Congress would continue as a 
unicameral body with equal representation for each 
state, would appoint a national executive for a single 
term, and would remove the executive if a majority of 
State executives petitioned for it (Hutson 1987, 85—86; 
O’Connor 1979, 145-47; RFC I, June 15, 242-45; I, 
178-79, 611-15). 

Connecticut’s delegates increased their political 
leverage as a pivotal delegation by voting against the 
New Jersey Plan they themselves had helped write 
(RFC June 19, I, 322). This vote made the Sherman 
and Dickinson bicameral compromise an appealing, 
practical, and necessary position halfway between the 
New Jersey and the Virginia plans. Mutual wariness 
increased as substantive economic issues, such as the 
Western lands, trade, and slavery, became indivisibly 
intertwined with representation. Connecticut’s dele- 
gates relentlessly emphasized the need to arm the states 
for political self-defense by providing them agency in 
national policy-making institutions (RFC June 25, I, 
401-3; June 29, I, 461, 466; June 30, I, 485-86, 490- 
92, 500). Individual delegates from the large states 
such as Mason, Pennsylvania’s Benjamin Franklin, and 
Massachusetts’s Elbridge Gerry were beginning to 
concede that argument and bend toward compromise 
(RFC June 25, I, 404, 407, 417; June 30, I, 488; July 2, I, 
514). Madison sounded a desperate note when he used 
slavery to try to redefine the Convention’s fundamental 
cleavage to protect the slave states and hold them in 
his coalition (RFC June 30, I, 486-87). 

As soon as Madison’s six-state coalition held to- 
gether to win a final vote on proportional representa- 
tion in the House of Representatives, Oliver Ellsworth 
moved to establish equal state representation in the 
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Senate (RFC June 29-30, I, 468-69, 484-85). The Con- 
vention deadlocked on Senate representation on July 2, 
with the three large states and the Carolinas in fa- 
vor, the five states involved in drafting the New Jer- 
sey Plan opposed, and Georgia divided. According 
to Luther Martin, Daniel of St. Thomas Jennifer, a 
Maryland ally of the Madison coalition, was absent 
when the vote was taken. Jennifer’s vote would have 
divided the Maryland delegation and decided the out- 
come in Madison’s favor. Rufus King asked that the 
vote be reconsidered after Jennifer arrived but was 
refused (RFC II, 188, note). Sherman declared that 
“we are at a full stop” and convinced the delegates 
that political expediency required a committee to iron 
out the differences. Madison strongly opposed the 
idea of a committee, but the Convention approved 
it anyway. Even a majority of the Virginia delegation 
abandoned him. The committee, stacked with Madison 
opponents and compromisers, initially included 
Ellsworth, Martin, Yates, and Bedford from the states 
outside Madison’s coalition and the conciliatory Gerry, 
Franklin, and Mason from the large states. Sherman 
managed to replace Ellsworth when the latter “was 
kept away by indisposition.” (EUsworth’s indisposition 
was providentially brief, perhaps suspiciously so, be- 
cause the delegates must have considered Ellsworth 
sufficiently fit to serve on the committee when it was 
elected at the end of the day on Monday July 2, and he 
attended when the delegates reconvened the follow- 
ing Thursday.) The committee proposed the bicameral 
compromise, later known as the “Connecticut compro- 
mise,” on July 5 (RFC June 30, I, 484-85, 492; July 2, I, 
511-16; July 5, I, 526-27, 532). 

Madison’s initial ambitions for the reconstituted gov- 
ernment’s design were demolished during the next 
eleven days. In this period, a few potential allies de- 
fected to the compromise plan for bicameral repre- 
sentation, while key southern delegates defected from 
Madison’s plan for broad national authority. 

Though the issue receives little attention in accounts 
of the Convention, the apportionment of the House 
of Representatives now played a major role in tip- 
ping the Convention’s balance against Madison. The 
delegates recognized that the first Congress would 
make portentous, path-setting decisions about “prin- 
ciple acts of government” (Gerry, RFC August 21, II, 
357), including tariff schedules, the disposal of state 
and national debts, the creation of new offices and 
courts, major appointments, and the location of the 
national capital. The committee that reported the bi- 
cameral compromise did not specify each state’s al- 
location of seats in the first House of Representa- 
tives; instead, it simply allocated one representative in 
the lower house for each 40,000 inhabitants, counting 
three-fifths of slaves. Madison ally Gouverneur Morris 
moved to specify each state’s apportioned seats in the 
first House. The delegates appointed a new committee 
of five to propose definite numbers. This committee 
included Morris, Nathaniel Gorham, and Rufus King of 
Massachusetts, Edmund Randolph of Virginia, and 
John Rutledge of South Carolina—all representing 
states in Madison’s coalition. The Convention put 
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off debate on Senate apportionment pending the report 
of the committee (RFC July 5, I, 533-34; July 6, I, 540, 
543). On July 9, this committee of five recommended 
a House with 56 seats: Virginia would have nine seats, 
Pennsylvania eight, Massachusetts seven, New York 
and the Carolinas five seats each, Connecticut and 
Maryland four each, New Jersey three, New Hampshire 
and Georgia two, and Rhode Island and Delaware 
one each. The report proved that Sherman’s and 
Elisworth’s warnings were right: Proportional repre- 
sentation could indeed put national power in the hands 
of as few as four states. Massachusetts, Pennsylvania, 
and Virginia together held 43% of the proposed 
House seats. New York, North Carolina, or South 
Carolina individually could play a pivotal role in cre- 
ating a four state coalition in the House that con- 
trolled a majority of the chamber’s votes (RFC July 9, 
I, 559). 

This proposal met with a swift backlash. Sherman 
challenged the report, pointing out that it “did not 
appear to correspond with any rule of numbers, or 
of any requisition adopted by Cong/ress].” He moved 
to refer the report to a committee of 11 members, 
one from each state present. Morris and Randolph 
quickly backed away from their own report, and the 
Convention overwhelmingly approved this new com- 
mittee. Revealingly, South Carolina and New York 
voted against the creation of a second committee. On 
July 10 this committee on House apportionment, on 
which Sherman served, proposed a 65-seat House (that 
is, a multiple of thirteen). Compared to the previous 
plan, smaller states gained seats relative to larger ones, 
and four states could no longer form a majority in the 
House. This allocation was approved and written into 
the final Constitution, despite repeated efforts to add 
a seat here or subtract a seat there. By making it im- 
possible to form a four-state majority, the new House 
apportionment reduced the potential political clout of 
New York and each of the Carolinas, compared to the 
preceding report. North Carolina’s Hugh Williamson 
responded with alarm: “The South[ern] interest must 
be extremely endangered by the present arrangement.” 
Two New York delegates left the Convention after 
July 10 and never returned. South Carolina’s dele- 
gates began to battle furiously to increase their lever- 
age by insisting on full representation of slaves in 
apportioning representatives, reducing the apportion- 
ment of seats to the northern states, and increas- 
ing the apportionment of seats to the south (RFC 
July 10, I, 566-68; July 11, I, 580). The Conven- 
tion’s dynamics had shifted against Madison irrever- 
sibly. 

Massachusetts, North Carolina, and Georgia aban- 
doned Madison’s coalition and joined with Con- 
necticut, New Jersey, and Delaware to defeat an 
eleventh-hour proposal to rescue proportional rep- 
resentation in the Senate. At the start of the next 
week, North Carolina joined Connecticut, New Jersey, 
Delaware, and Maryland to form a majority for the 
final vote on the Connecticut compromise. Because 
Massachusetts divided on this vote, Madison could 
only muster the votes of Virginia, Pennsylvania, South 
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Carolina, and Georgia against the bargain (RFC July 
14, D, 12; July 16, LI, 15). 


ALTERING MADISON’S SUBSTANTIVE 
PLANS FOR THE CONSTITUTION 


With Madison’s political strategy for the Convention 
now spoiled, large and southern states pursued inde- 
pendent strategies to defend their comparative advan- 
tage. Many southern delegates decisively aligned with 
Sherman’s position that national authority should be 
limited and enumerated. At the same time, Madison 
and his closest allies fought to assure that the presi- 
dent would have authority to pursue national interests 
against a Senate they now viewed as the agent of the 
states. 


Dividing Government Authority 


Prior to the Convention, Americans took for granted 
the principle that sovereignty could not be divided 
among governments in the same area. They accepted 
British legal scholar Sir William Blackstone’s con- 
tention (1765-69) that sovereignty must be lodged in 
a single “supreme, irresistible, absolute, uncontrolled 
authority” (49). Accordingly, Madison, Gouverneur 
Morris, James Wilson, Alexander Hamilton, and 
Rufus King insisted that national authority could not 
be shared with the states (RFC May 30, I, 43-44; June 
8, I, 167, 172; June 9, I, 180; June 19, I, 322-24). At first, 
the delegates raised few questions about broad national 
sovereignty, voting for Madison’s national veto of state 
laws. Only Sherman voted against this early endorse- 
ment of the national veto (RFC May 31, IL, 54). 

From the start, Sherman took the lead in making 
the innovative counterargument that the state and na- 
tional governments “should have separate and distinct 
jurisdictions” (RFC June 7, 1, 150). When Gouverneur 
Morris argued “that in all communities there must be 
one supreme power, and one only,” Sherman merely 
conceded “that the General & particular jurisdictions 
ought in no case to be concurrent” (RFC May 30, 
I, 34-35, 42). By linking this admission to the claim 
that the existing system should not be greatly changed, 
Sherman subtly established the framework for separate 
realms of policy authority for both the state and the na- 
tional governments. Later, he dexterously used core re- 
publican values to defend states’ rights, reminding the 
delegates that “each State like each individual had its 
peculiar habits usages and manners, which constituted 
its happiness. It would not therefore give to others a 
power over this happiness, any more than an individual 
would do, when he could avoid it” (RFC June 20, I, 341- 
43; see Ellsworth, RFC June 30, I, 492). Ellsworth’s 
succinct formulation, “We were partly national; partly 
federal,” irked Madison at the Convention, but it ac- 
curately expressed Connecticut’s aspiration for Con- 
stitutional design as well as the working rules of the 
existing political system (RFC June 29, I, 469; RFC 
July 14, I, 8-9). A vote to broaden the national veto 
was defeated soundly in early June, supported only 
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by the three large states and opposed by the rest of 
the south (RFC June 8, I, 164-68, 171-72). Madison’s 
plan for broad national authority was beginning to 
crack. 

As soon as the Convention adopted the bicam- 
eral compromise, several delegates from the South 
decisively shifted to become “state federalists” who 
supported explicit protections for the states’ policy 
prerogatives. Edmund Randolph and most of South 
Carolina’s delegates broke with Madison’s original 
plan and called for limited, enumerated national pow- 
ers (RFC July 16, II, 17). “The security the South[er]n 
States want,” said Pierce Butler of South Carolina, 
“is that their negroes may not be taken from them” 
(RFC July 13, I, 605; see General Pinkney, July 23, 
I, 95). South Carolina’s urge to protect its vital eco- 
nomic assets played into Connecticut’s hands. On 
July 17, Sherman asserted that the national veto of state 
laws approved in May was not necessary. Pennsylvania, 
South Carolina, and Georgia joined the four econom- 
ically disadvantaged states to remove the national 
veto from the plan (RFC July 17, O, 27-28). Nine 
days later, the Convention turned over its work to 
a five-member Committee of Detail, which included 
Oliver Ellsworth, John Rutledge, James Wilson, the 
conciliatory Nathaniel Gorham, and the back ped- 
aling Edmund Randolph. Ellsworth, Rutledge, and 
Randolph now constituted a committee majority for 
limited and enumerated national powers (Hueston 
1990). The Committee of Detail’s August 6 draft 
dropped the Virginia Plan’s broad grant of national au- 
thority and marked out a list of suitable national pow- 
ers and of powers forbidden to the states. It authorized 
the national government to make all laws “necessary 
and proper” for implementing these powers (Rakove 
1996, 178; RFC I, 167-68). It limited national author- 
ity to commerce “with foreign Nations & amongst the 
several States,” in effect excluding intrastate commerce 
from national control (RFC II, 142-43, 168-69, 183). 
The Committee thus underwrote the states’ power to 
manage economic affairs within their borders and nar- 
rowed the national authority to international and in- 
terstate relations. 

When Madison and his closest allies subsequently 
fought to expand national economic powers, Sherman 
and his allies defeated them almost entirely. To the 
very end of the Convention, Madison insisted that 
interstate and intrastate commerce were insepara- 
ble and should be regulated wholly by the national 
government (RFC September 15, II, 625). Sherman 
thought the distinction was largely self-evident and 
urged that the states should retain the authority to reg- 
ulate their internal commerce, with disputes resolved 
by the courts or Congress (RFC ILI, 615). Sherman’s 
position prevailed because the defeat of proportional 
representation caused southern delegates to defend 
their state policy prerogatives against a national gov- 
ernment whose policy direction seemed ever more 
uncertain. Southern delegates sought supermajority 
rules in Congress for commercial treaties, aiming to 
impair national treaties that could harm their export 
economies. While many northerners inside and outside 
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Madison’s original coalition resolutely opposed slav- 
ery, Connecticut played a pivotal role as a New England 
delegation that defended the South’s policy preroga- 
tives. Sherman argued that, “as the States were now 
possessed of the right to import slaves, as the public 
good did not require it to be taken from them, & as 
it was expedient to have as few objections as possible 
to the proposed scheme of Government, he thought it 
best to leave the matter as we find it” (RFC August 22, 
I, 369, 372-75). Yet another committee developed a 
grand compromise on these sticky issues of commercial 
authority. The committee’s “understanding” allowed 
the import of African slaves until 1800 while requir- 
ing only simple legislative majorities to enact commer- 
cial treaties (RFC August 24, II, 400). This compro- 
mise more clearly demarcated the spheres of national 
and state policy authority. The national government 
could provide commercial treaties required by national 
sovereignty, but it could not interfere with the way a 
state managed economic assets it considered vital, even 
when state policy protected a practice as reprehensible 
as slavery (RFC August 29, II, 449-53). 

Sherman also successfully fought proposals by 
Madison and others to authorize national canal con- 
struction, corporate charters, a national university, and 
militia regulation (RFC August 18, I, 326, 330-33; 
September 12, I, 587-88; September 14, II, 615-16). 
Sherman, though, showed no hesitation to use national 
law to limit negative externalities of policy made by 
other states. He supported an absolute national prohi- 
bition on states’ power to issue currency (RFC June 8, 
I, 165; August 28, I, 439). 


Aa ag Agency In Natlonal 
Policy-Making 


After July 16, the struggle to control the national 
policy-making process focused on specific choices 
about the powers and selection of Congress, the presi- 
dent, and the courts. Once the Convention had agreed 
to make the Senate the agent of the state governments, 
Madison and several allies took up the fight to make the 
president independent of Congress, or, as Madison put 
it, to give the president “free agency with regard to the 
Legislature” and to broaden presidential powers (RFC 
July 17, I, 34; July 19, I, 56). Confident that the most 
vulnerable states had a credible defense in the Senate, 
Sherman advanced proposals to make national policy- 
makers as dependent on the Senate as possible (RFC 
July 17, I, 29; July 18, II, 41-43; August 24, II, 405-6). 
The Committee of Detail’s August 6 draft was more 
consistent with Sherman’s preferences than Madison’s. 
State legislatures would directly choose and pay U.S. 
Senators and would determine voter qualifications, 
districts, and time for U.S. House elections. Congress 
would appoint the president, and the states’ national 
policy agent, the Senate, had sole power to appoint 
Supreme Court judges and ambassadors, make treaties, 
and settle disputes among states. Because the plan pro- 
vided that the House and Senate would vote separately 
to appoint the president, the Senate had the power to 


veto a presidential candidate favored by the House. 
Sherman relished that possibility (RFC August 6, 
Il, 177-89; August 7, I, 196-97). 

Confronting the prospect of a powerful Senate serv- 
ing as the agent of the state governments, Madison and 
his allies redoubled their efforts to separate the pres- 
ident from the Senate. When the Convention reached 
an impasse on presidential selection, Sherman again 
urged the delegates to create a committee to deal 
with such unfinished business. This Committee on Post- 
poned Matters consisted of several of the Convention’s 
prominent protagonists, including Madison, Sherman, 
Dickinson, and Gouverneur Morris. On September 4, 
the committee recommended in effect a grand compro- 
mise on the presidency. State legislatures would deter- 
mine how presidential electors would be chosen. The 
number of electors allocated to a state would equal the 
sum of its House and Senate seats, a formula equivalent 
to a joint vote of the House and Senate and one that 
modestly equalized the weight of the states in choosing 
the president. The Senate would appoint the president 
if all candidates fell short of a majority (RFC August 24, 
IIL, 403; August 31, II, 481; September 4, II, 497-500; 
OI, 132). The committee also compromised on the se- 
lection of other officers and judges by providing for 
presidential appointment with a Senate veto (RFC 
September 6, I, 523; September 7, U, 538-40). Finally, 
the committee provided that the president would have 
power to make treaties but would need the approval of 
a two-thirds majority of the Senate (Rakove 1996, 89- 
90; RFC September 4, I, 493-94, 498-99). Sherman, 
Rutledge, and others with judicial experience partic- 
ularly supported an independent policy role for the 
courts. Accordingly, the courts’ role expanded incre- 
mentally and with little opposition (Rakove 1997). 

When the Convention reconsidered the Constitu- 
tional amendment process in its final week, debate 
turned on whether that process posed a preater threat 
to the states or to the national government. Sherman 
proposed that Congress submit amendments to the 
states. The Convention approved this method, specify- 
ing ratification by three-quarters of the states, a provi- 
sion that became the most common method for amend- 
ing the Constitution. Madison then moved to include 
the other amendment alternatives, while Rutledge in- 
sisted that the slave trade be place beyond amendment 
until 1808. The Convention overwhelmingly approved 
both proposals (RFC September 10, I, 557-59). 

On its last full day of debate, Sherman sandbagged 
the Convention. Calling attention to the precedent set 
by Rutledge’s slave trade provision, he moved to add 
that “no State shall without its consent be affected in 
its internal police, or deprived of its equal suffrage 
in the Senate.” The Convention defeated Sherman’s 
last minute motion, with only Connecticut, New Jersey, 
and Delaware in favor. However, the “circulating mur- 
murs of the small states” prompted an immediate 
reassessment of the provision for Senate representa- 
tion. By a unanimous vote, equal state representation 
in the Senate became the only provision that could 
never be changed by Constitutional amendment (RFC 
September 15, I, 629-31). 
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THE INFLUENCE OF MADISON’S 
CONVENTION OPPONENTS 


James Madison played a principal role in bringing 
the U.S. Constitution into existence. Without his pre- 
Convention efforts to build an agenda, a support- 
ive coalition, and Convention legitimacy, it is hard 
to imagine how the delegates would have produced 
a document endorsed by political leaders from ev- 
ery state but Rhode Island. His Convention notes 
remain the most definitive record of its proceedings. 
His tactic for popularly elected state ratifying conven- 
tions (which succeeded despite Sherman’s opposition) 
and his vigorous advocacy of the Constitution in New 
York and Virginia probably were necessary for the 
Constitution’s adoption. Perhaps the most important 
proponent of religious freedom in the United States 
(Wills 2002), Madison shepherded the Bill of Rights 
through the first Congress. As a leader in the House 
of Representatives, President George Washington’s 
sometime ghostwriter, founder of the Democratic 
Republican party, Secretary of State, President, and 
esteemed elder statesman, Madison subsequently ex- 
ercised a singular influence in Constitutional develop- 
ment. 

Bringing the Constitution into existence was very dif- 
ferent from designing its provisions, however. Madison 
failed to persuade the delegates to include the three 
most significant features of his spring, 1787 plan for 
reconstituting government. First, he sought very broad 
national authority to regulate commerce, levy taxes, 
guide economic development, and veto state laws. In- 
stead, the Convention produced a national govern- 
ment with comparatively narrow, specified authority. 
The original Constitution underwrote the state gov- 
ernments’ authority to govern routine activities of most 
Americans, and it made national preemption of state 
authority much more difficult than Madison hoped. 
Second, Madison sought to eliminate state agency in 
national policy-making entirely. Instead, the Conven- 
tion thrust into the middle of the national policy pro- 
cess an assembly of state agents, a Senate that would 
represent each state government on an equal basis. The 
Constitution gave to this Senate the special privileges 
of consenting to treaties, approving presidential ap- 
pointees, and potentially convicting the president of 
impeachable offenses. Third, Madison sought to es- 
tablish relative population or wealth as the basis for 
apportioning seats in both the House and Senate, and 
he needed to achieve this goal very soon after the start 
of Convention deliberations. Instead, the Convention 
compelled him to accept an unwanted compromise on 
this central goal. Roger Sherman, John Dickinson, and 
other delegates from the economically disadvantaged 
states defeated Madison’s aspiration for a relatively 
brief meeting driven by a swift agreement on propor- 
tional representation in both houses of Congress. In- 
stead, they changed the Convention’s dynamics, forcing 
a three and a half—-month political struggle largely de- 
fined by Madison’s aspirations for the nation’s political 
future and Sherman’s defense of the nation’s political 
present. 
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Endorsements by Roger Sherman and his Connecti- 
cut colleagues legitimized several other Constitutional 
provisions, thus facilitating their inclusion: a bicam- 
eral legislature, an independently selected president 
checked by the Senate, temporary tolerance of the 
slave trade, and an independent judiciary with the 
power of judicial review of state laws. Sherman also 
legitimized a host of less important provisions, such as 
the designation of the Vice President as President of the 
Senate and of the House of Representatives as the final 
arbiter of presidential selection (RFC September 6, 
II, 527; September 7, I, 537). 

Evidently, the states outside of Madison’s coalition 
considered the Constitution a success for their inter- 
ests. They had reduced the two main threats posed 
by the Virginia Plan by retaining control over their 
internal policy and by making it hard to use the na- 
tional government to harm their constituents and their 
economic assets. They achieved their three main ob- 
jectives. The Constitution nationalized specific public 
goods, established strong defenses for their existing 
advantages, and allowed for the nationalization of fur- 
ther public goods only if proponents could meet the 
high transaction costs required for making public policy 
in the new national system. Soon after the signing of 
the Constitution, Roger Sherman and Oliver Ellsworth 
declared victory in a letter to their governor. The Con- 
vention “endeavoured to provide for the energy of gov- 
ernment on the one hand, and suitable checks on the 
other hand, to secure the rights of the particular states, 
and the liberties and properties of the citizens.” Just 
as in the Confederation Congress, Connecticut would 
continue to enjoy a one-thirteenth share in selecting 
important national policy-makers: Connecticut would 
fill one-thirteenth of the seats in the new U.S. House of 
Representatives and Senate and cast one-thirteenth of 
the votes in the Electoral College. Connecticut’s dele- 
gates had secured a satisfactory distribution of national 
and state public goods. “Some additional powers are 
vested in congress, which was a principal object that the 
states had in view in appointing the convention. Those 
powers extend only to matters respecting the common 
interests of the union, and are specially defined, so that 
the particular states retain their sovereignty in all other 
matters” (RFC III, 99-100). Madison would not have 
been able to disagree with this assessment. 


MADISON’S CONVENTION OPPONENTS 
AND AMERICAN POLITICS 


The original Constitution’s design is not the product of 
a systematic philosophical plan, but the by-product of 
a path-dependent sequence of political compromises 
largely forced on Madison and his allies by their Con- 
vention opponents. No one in May 1787 anticipated 
the final product. No delegate told the Convention 
that the result was philosophically superior to any 
other theoretical plan. Instead of fully solving their 
central political dilemma—getting better national pub- 
lic policy from a republican national government that 
would not itself threaten vital interests—the framers 
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narrowed the scope of the problem, allocating some 
authority clearly while delegating to the political pro- 
cess the divisive issues that remained too intractable in 
the summer of 1787. Constitutional rules about policy 
agency and policy-making set constraints on this po- 
litical process. For 39 signers, the document met the 
criteria for republican government and seemed likely 
to produce better policy outcomes than the status quo. 
Subsequently, constructing the Constitution essentially 
has been a political task (Whittington 1999). 

The political accommodations on which Madison’s 
Convention opponents insisted have had an enduring 
impact on American political development. Federal- 
ism illustrates this impact as well as any Constitutional 
feature. American federalism was not designed sim- 
ply to check national power, because too many of the 
Convention delegates wanted the national government 
to have some very effective powers to provide for 
currency regulation, commercial treaties, a national 
revenue stream, national defense, and other nation- 
alized public goods beneficial to their constituents and 
the nation. While Madison battled for extensive na- 
tional sovereignty, Roger Sherman advanced the idea 
of dual sovereignty as a justification for nationalizing 
some public goods while protecting the states’ remain- 
ing autonomy and political prerogatives. Federalism in 
the Constitution emerged from a long series of inter- 
dependent, issue-by-issue compromises, guided by the 
ongoing clash between the Madison and the Sherman 
positions on which level of government would control 
which tools of public policy and how each level would 
use those tools to govern politics. Sherman and his 
allies got a limited nationalization of public goods, a 
list of enumerated national powers, state control of 
residual policy authority, and some defensive tools that 
states could use to fight the aggregation of power by 
the national government. Despite Madison’s efforts, 
the national government would manage only interna- 
tional and interstate commerce, while the states would 
govern commerce within their borders. Madison and 
his allies got a sort of ersatz national veto vested in the 
federal courts through the supremacy clause and some 
Constitutional language that could be used to press 
for the expansion of national power on a case-by-case 
basis. 

The delegates artfully blurred many politically por- 
tentous terms and boundaries between state and na- 
tional authority and then handed the ensuing boundary 
issues over to the politicians who would contest them 
according to the newly established policy-making rules. 
They neither defined this boundary clearly nor pro- 
vided definitive guidelines for resolving these future 
conflicts (see also Rakove 1996, 201). The delegates 
never specified precisely what kinds of public goods 
would be provided by the states and which by the na- 
tional government. They did not specify a list of the 
civil rights and liberties to which Americans would be 
entitled; instead, Madison put forward a bill of rights 
in the first Congress after promising his constituents he 
would do so. Sherman’s own list of essential national 
powers was glaringly vague (RFC June 6, I, 133), and 
the delegates collectively were at a loss to define key 


warrants of national authority such as “direct taxes” 
(RFC August 20, I, 350). The Committee of Detail’s 
insertion of the “necessary and proper” clause strongly 
suggests that even Connecticut’s delegates were willing 
to permit the expansion of national power to cover 
hard cases and unexpected contingencies. The dele- 
gates’ explicit rejection of Sherman’s proposal to place 
state police powers beyond Constitutional amendment 
demonstrates their desire to leave the range of national 
authority fungible. National power could be expanded 
but could not be expanded at will. They could not 
agree on precisely how elastic national power should 
be, but they could agree that by raising institutional de- 
fenses to lower the probability of outcomes they feared, 
they could reduce the threat that the national gov- 
ernment would take advantage of their constituents. 
States would influence national policy outcomes, and 
their agents could delay, defeat, or extract concessions 
for the expansion and use of national powers (see also 
Wechsler 1955). 

Political opponents have been using federalism as 
a political weapon ever since the first meeting of the 
reconstituted Congress in 1789. The most wrenching 
conflicts in American political development repeat- 
edly have been displaced by battles over the defini- 
tion of the dividing line between state and national 
authority. Competing claims of states’ rights and na- 
tional responsibility have structured the most substan- 
tive political conflicts of American political develop- 
ment, including slavery, the regulation of business and 
investment, civil and criminal rights, religion, welfare, 
health, education, the environment, alcohol, and a host 
of other issues. The ambiguous boundary between in- 
terstate and intrastate commerce became a flashpoint 
for many significant struggles over the control of the na- 
tion’s economic development. In the political struggle 
between employers and workers since the Civil War, 
political opponents have used federalism as a weapon 
to allocate policy authority over the hours Americans 
work, the wages they are paid, the conditions of their 
workplaces, the age at which they begin worklife, and 
the terms on which they can organize trade unions to 
advance their interests. The Constitution’s ambiguous 
allocation of policy authority contributed to an Amer- 
ican trade union strategy that left the United States, 
uniquely, without a distinct social democratic or labor 
political party (Robertson 2000). 

Political interests still utilize the Constitution’s un- 
settled political frontier of federal authority as an ex- 
pedient tool for achieving substantive policy outcomes. 
Often, “states rights” and decentralization are asso- 
ciated with conservative policy agendas, such as that 
of Ronald Reagan. Liberals, however, regularly use 
states’ rights arguments when they help achieve the 
policy outcomes they advocate, just as conservatives 
regularly invoke national authority to engineer the out- 
comes they prefer. In the 1990s, liberal state attorneys 
general used state policy authority to extend consumer 
protection, prosecute tobacco companies, and chal- 
lenge Microsoft’s market dominance. Democrats on 
the Federal Communications Commission supported 
continued state regulation of telecommunications 
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services in 2003, while Republican chair Michael Pow- 
ell insisted on national telecommunications rules per- 
mitting more latitude to economic actors. When the 
gay marriage issue arose early in the 2004 presidential 
campaign, Democratic candidate Howard Dean sought 
to deflect the issue from the national campaign by ar- 
guing that the states should exercise authority over the 
issue. Three days after Missouri voters overwhelmingly 
approved an amendment to the state constitution ban- 
ning gay marriage, Democratic Vice-Presidential nom- 
inee John Edwards made a similar argument for states’ 
rights (Mannies 2004). Though liberal Democrat Al 
Sharpton and conservative Republican George Bush 
had completely different agendas and constituencies, 
both candidates argued that the gay marriage issue 
should be nationalized rather than left to state dis- 
cretion. 

Dual federal and state sovereignty, then, funda- 
mentally is a political invention, devised by skilled 
politicians with deliberate and expedient imprecision. 
Searching for the definitive original intent for the spe- 
cific division of powers between the nation and the 
states is a fool’s errand. 


CONCLUSION 


The Convention delegates who opposed James 
Madison’s Virginia Plan have had a far greater impact 
on American politics than most Americans appreciate. 
Opponents of the Virginia Plan aimed to add a few 
carefully chosen powers and offices to the national 
government while, at the same time, protecting most 
of the states’ policy prerogatives and ensuring that the 
state povernments would influence the way the national 
government used its powers. Undeniably, Madison’s 
opponents achieved much of what they sought. Led 
by Roger Sherman and the Connecticut delegation, 
they spoiled Madison’s strategy for the Convention. 
They altered key features of his Constitutional plan 
and engineered substitute provisions better suited to 
their states’ political interests. Because of Sherman and 
his Convention allies, the Constitution established a 
national government with less authority and less inde- 
pendence of the states than Madison initially wanted. 
Madison’s Convention opponents, then, made the ba- 
sic, enduring rules of American politics much more pro- 
tective of the states, and more resistant to geographical 
redistribution, than the rules Madison set out to put in 
place. 

Because American politicians have had to play by 
these basic rules since 1789, Sherman and his allies 
have had a cumulative and lasting effect on the way 
American politics has evolved. Madison’s Convention 
opponents helped produce government that is more 
complicated and harder to use than any delegate ex- 
pected (Robertson 2005). Their influence endures in 
Constitutional provisions that protect the states’ policy 
authority, that safeguard equal state influence in na- 
tional policy-making through the Senate, and that pro- 
vide for a state-based system for selecting the president. 
If Madison had had his way, for example, the national 
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government would have enjoyed unambiguous author- 
ity to control intrastate commerce. If the national gov- 
ernment exercised this uncontested commercial au- 
thority from the beginning, America’s most explosive 
policy conflicts, including slavery, trade unions, and 
civil rights, would have played out differently, changing 
Americans’ political inheritance in fundamental ways. 

Yet if either Madison or his main opponents had 
walked away from the Constitutional Convention, it 
is difficult to imagine how the meeting would have 
produced a comparable political success. The polit- 
ical synergy between Madison and Sherman, then, 
very well may have been necessary for the Consti- 
tution’s adoption. Ironically, his opponents’ achieve- 
ments also became indispensable for the success of 
Madison’s own subsequent political career. During 
the Convention, Madison concentrated on strength- 
ening political institutions to pursue national inter- 
ests and ignored the possibility that national interest 
could be pursued in ways antagonistic to his own vi- 
sion for the nation’s future. But their inherently in- 
compatible visions of national interest opened a polit- 
ical chasm between U.S. Representative Madison and 
Treasury Secretary Alexander Hamilton in the early 
1790s. During the Convention, Sherman concentrated 
on defending the political economies of the states and 
ignored the difficulties of drawing a clear distinction 
between state and national authority. It was Madison, 
not Sherman, who was among the first American politi- 
cians to seize on this distinction. Madison, defend- 
ing “state’s rights,” used Constitutional guarantees of 
state policy authority to build a national coalition of 
diverse interests opposed to Hamilton’s program for 
national economic direction. Madison’s political career 
after 1787, like America’s political development, owes 
a considerable—if unappreciated—debt to his oppo- 
nents’ influence on the Constitution’s design. 
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Flexing Muscle: Corporate Political Expenditures as Signals 


to the Bureaucracy 


SANFORD C. GORDON and CATHERINE HAFER New York University 


egulatory agencies impose costs and benefits tailored to individual firms through their discre- 

tionary enforcement activities. We propose that corporations use political expenditures in part 

to “flex their muscles” to regulators and convey their willingness to fight an agency's specific 
determinations in the political arena. Because the signaling function of political expenditures is strate- 
gically complex, we derive a formal model wherein we demonstrate the existence of an equilibrium in 
which (1) large political donors are less compliant than smaller ones, but the bureaucracy monitors them 
less, and (2) firms with publicly observable problems reduce their political expenditures. We test the 
empirical implications of the model using plant-level data from the Nuclear Regulatory Commission on 
the inspection of 63 privately operated nuclear power plants and the political expenditures of their parent 
companies. We find strong evidence for the first prediction and qualified support for the second. 


hen corporations and their employees spend 
WW zoer in the political arena, to what ex- 
tent are they purchasing special treatment? 
Complaints about the deleterious effects of corporate 
money in politics are ubiquitous in American politi- 
cal discourse. However, in order to pinpoint the place 
of these expenditures in American democracy, and to 
appreciate fully the consequences of various reform 
proposals, itis critical to identify the precise mechanism 
through which any preferential treatment might occur. 
In this spirit, scholars have suggested that corpora- 
tions make political expenditures because they hope 
to influence government behavior, through either ex- 
plicit quid pro quos with legislators for constituency 
service or favorable votes (Baron 1989; Grossman and 
Helpman 1994, 2001), improved access to legislators 
(Hall and Wayman 1990; Langbein 1986), or enhance- 
ment of the electoral prospects of sympathetic incum- 
bents (Poole and Romer 1985). Empirical evidence ver- 
ifying a definitive relationship between expenditures 
and specific policy change, unfortunately, has proven 
elusive. In a recent review essay, Ansolabehere, de 
Figueiredo, and Snyder (2003, 116) conclude, “Con- 
tributions explain a miniscule fraction of the variation 
in voting behavior in the U.S. Congress.” They suggest 
that contributions are better thought of as consumption 
goods that fulfill a desire to participate in politics than 
as investment in a political marketplace. 
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At the same time, recent research has suggested that 
campaign expenditures may be the tip of the iceberg. 
Wright (1990) and Ansolabehere, Snyder, and Tripathi 
(2002) note that lobbying, rather than campaign con- 
tributions by political action committees (PACs), con- 
stitutes the bulk of interest group political expendi- 
tures. Because data on lobbying expenditures have 
been harder to come by, we know little about their 
effect on policy. Still, we can recast the debate over 
campaign contributions as one about political expendi- 
tures more broadly: If corporate political expenditures 
merely reflect the expressive desires of a few execu- 
tives, we needn’t be too worried that those with deep 
pockets get their way in politics more often than oth- 
ers. If, on the other hand, such expenditures are an in- 
vestment whose return political scientists have simply 
failed to document, there is greater cause for concern 
that access to the levers of power exists only for a select 
few. 

We propose that political expenditures serve as a 
visible means by which corporations flex their muscles 
to potential adversaries, in particular, the bureaucracies 
charged with regulating them. Regulatory agencies can 
impose costs and benefits tailored to individual compa- 
nies through the discretionary enforcement of statutes 
and rules. If a company can communicate its intention 
to fight agency decisions through subsequent action in 
the political arena, regulators will simply choose either 
to regulate less or to regulate elsewhere. 

Accordingly, we should expect reduced regulatory 
oversight of companies with larger political expendi- 
tures. However, it could be that large expenditures 
would implicate a company in noncompliance and spur 
greater oversight. Companies would then want to re- 
duce political spending to avoid being targeted. To 
better understand the specific conditions under which 
political expenditures might play the role we suggest, 
we present a model of costly signaling by a firm to a 
regulator. We use this model to demonstrate the ex- 
istence of an equilibrium wherein larger expenditures 
do induce the agency to reduce its regulation. We then 
test the model’s predictions using data on the inspec- 
tion activities of the Nuclear Regulatory Commission 


(NRC). 
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The key implication of our theory is that firms can 
extract policy concessions from regulators through po- 
litical expenditures to elected officials, even holding 
legislative behavior constant. This does not mean that 
corporations and their executives never attempt to buy 
legislative access or special treatment outright. It does 
imply that a firm may never need to fight an agency 
politically if it can credibly signal its willingness to 
do so. This insight helps to explain two puzzles con- 
cerning the role of money in politics. First, corporate 
political expenditures seem small relative to the ben- 
efits of legislative action because signaling a willing- 
ness to fight is always less costly than the fight itself 
Second, political scientists have often had difficulty 
documenting a systematic relationship between po- 
litical expenditures and policy outcomes empirically 
because, in essence, we’ve been looking in the wrong 
place. 

The paper also seeks to advance our understanding 
of regulatory politics and the relationship between bu- 
reaucracies and interest groups. Although in our model 
some firms achieve forbearance, regulators are not 
“captured” in the sense of an aging bureaucracy allying 
with or adopting the preferences of an industry it was 
originally intended to regulate (e.g, Bernstein 1955; 
Huntington 1952) or accepting side-payments from in- 
dustry (e.g. opportunities for future employment) in 
return for lax regulation (Gormley 1979; Laffont and 
Tirole 1991). Rather, the regulator simply reacts to the 
realities of its limited abilities and pursues the rewards 
of regulation in other places. 


SELECTIVE REGULATION AND POLITICAL 
EXPENDITURES 


Regulation is the area of public policy most frequently 
identified with efforts by corporations to influence gov- 
ernment officials’ behavior. According to one perspec- 
tive, producers offer political support to legislators in 
return for regulation if it reduces market uncertainties 
or imposes price, quantity, or entry restrictions to the 
benefit of existing firms (Grossman and Helpman 1994; 
Kolko 1965; Stigler 1971). Often, however, regulatory 
policy is characterized by an agency that compels firms 
to internalize the cost of some production externality. 
Inasmuch as firms wish to minimize such costs, they 
will enter the political arena in pursuit of regulatory 
forbearance (Honohan and Klingebiel 2000): the re- 
duction of regulatory burdens. 

Through what mechanisms might corporations plau- 
sibly influence government regulatory action through 
political expenditures? First, campaign contributions 
may secure the incumbency of allies (Endersby and 
Munger 1992; Poole, Romer, and Rosenthal 1987). 
For example, FirstEnergy Corporation, an Ohio- 
based utility, contributes frequently to Senator George 
Voinovich, a strong advocate of streamlining the pro- 
cess for building new nuclear plants. Second, expen- 
ditures may be service-oriented (Baron 1989; Snyder 
1990). In 1990, five senators were accused of hav- 
ing pressured the Federal Home Loan Bank Board 
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into diminishing its oversight of the Lincoln Sav- 

ings and Loan Association. Charles Keating, the chair 

of Lincoln’s parent company, had coordinated cam- 
aign contributions to the senators totaling more than 
1.3 million. 

A third mechanism is more subtle. As the Justice 
Department’s investigation and subsequent antitrust 
suit against Microsoft progressed in the late 1990s, the 
company drastically increased its lobbying and cam- 
paign expenditures. The Washington Post reported, 
“The company’s goal, according to people familiar 
with its strategy, is to discourage government lawyers 
from seeking aggressive sanctions in their antitrust law- 
suit” (Chandrasekaran 1999). Microsoft officials sug- 
gested that by lobbying Congress to cut the Antitrust 
Division’s budget in October 1999—too late to have 
any effect on their case—they were sending a message 
to the Division about their displeasure with its conduct 
(Morgan and Eilperin 1999). The American Antitrust 
Institute, a Washington nonprofit, called the action a 
negotiating ploy, noting, “Microsoft’s salvo is clearly an 
attempt to leverage its position through intimidation” 
(Lawsky 1999). 

During the lawsuit, lawmakers came to Microsoft’s 
defense in floor speeches railing against the bureau- 
cracy and Thomas Penfield Jackson, the presiding judge 
on the case. But what is striking about the episode 
is that company officials were hoping the expendi- 
tures themselves, rather than direct action by any leg- 
islators, would have the desired effect on the Justice 
Department’s strategy. By spending in the political 
arena, in other words, Microsoft was flexing its political 
muscles. 

Regulation is costly to firms, but it can also be 
made costly to regulators. We propose that corpora- 
tions sometimes make political expenditures to signal 
credibly their willingness to take actions that would im- 
pose costs on an agency in response to its enforcement 
efforts. In doing so, a firm can affect an agency’s discre- 
tionary enforcement behavior, and hence the expected 
costs to the firm of regulation. To work, such a causal 
mechanism requires that firms be able to impose costs 
on agencies, that the willingness to impose such costs 
differs across firms, and that these differences be com- 
municable via the signal. The last of these requirements 
is the focus of the next section. We describe the first 
two in greater detail here. 

There are many ways a firm can impose costs on 
an agency in response to the agency’s enforcement ac- 
tions. Perhaps the simplest is to contest the agency’s 
findings, either informally, as with company officials 
hashing out the appropriate response to a safety vi- 
olation with compliance officers, or formally, through 
an appeal process within the agency or in court. To 
the extent that an agency must dedicate resources to 
responding to such appeals, and to the extent that such 
appeals are successful, an agency is worse off when a 
firm contests actions than when it does not. 

A firm may also appeal to Congress for relief, 
imposing costs on an agency indirectly in the pro- 
cess. Congress determines the size of agency budgets 
(Fenno 1966), which may or may not be devoted to 
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enforcement.! Individual legislators can demand time- 
consuming and resource-intensive oversight hearings 
that agency officials prefer to avoid (Aberbach 1990). 
The legislature can also alter the substantive content 
of an agency’s enabling statute or shift the agency’s 
burden of proof to make a firm’ s appeal of enforce- 
ment actions in court easier.” Each of these possible 
responses to an appeal from a firm makes the agency 
worse off. Ceteris paribus, the more vigorously the firm 
contests the agency’s actions, the lower the net benefit 
to the agency of documenting infractions at that firm 
in the first place. 

Of course, the means with which a firm can impose 
costs on an agency entail costs for the firm as well. It 
is thus not necessarily the case that a firm will want to 
undertake the effort. The potential benefit to a firm of 
contesting an agency’s decisions is the suspension or 
overturning of some of the agency’s decisions against 
it, so that the firm is not required to correct as many 
infractions or pay as many fines as it otherwise would 
have been. Because the costs of correcting a given in- 
fraction differ across firms, firms differ in the net ben- 
efits that they obtain from appealing. Ceteris paribus, 
firms with higher costs of compliance find appealing 
an agency’s decisions against them—-whether in the 
agency, the court, or the legislature—more attractive, 
and can be expected to contest the agency’s decisions 
more vigorously. 

These two features—that the agency’s net benefit 
from documenting violations at a firm declines with 
the vigor of the appeal and that firms with higher costs 
of compliance are more willing to appeal—imply that 
the agency’s net benefit from discovering violations at 
a firm will decline with the firm’s compliance costs. 
If an agency could determine them, it would prefer 
regulating firms with higher compliance costs less strin- 
gently. Firms with different compliance costs will also 
differ in their valuation of regulatory leniency: Those 
with higher compliance costs value it more and, hence, 
would be more willing to bear costs ex ante in order 
to obtain it. It is therefore possible for a costly signal 
by a firm to communicate its compliance costs to the 
agency. 

While any costly expenditure that a firm could under- 
take and that would be observed by the agency could, 
in principle, be used to demonstrate its willingness to 
contest the agency’s decisions, political expenditures 
are especially well suited for this role. To see why, 
consider the following analogy: Lawsuits are costly, so 
litigants typically reach a settlement before going to 
trial. In order to improve his or her bargaining position, 
each party to the suit would like to be able to signal 
credibly to the other his or her willingness to spend a 
significant sum on a trial if a favorable settlement were 
not reached. While any costly action (i.e., “burning 


1 For a discussion of different constraints on the budget as an effec- 
tive weapon for legislators to wield (whether on behalf of or agninst 
corporate interests), see Quirk 1981 and Ting 2001. 

2 Of course, the legislature 1s not unique ın its ability to manipulate 
agencles; through their appointment power and statutory interpre- 
tation, presidents also play a critical role (see Moe 1985, 1987). 
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money”) could indicate his or her willingness to go 
to trial, retaining an army of high-priced trial attorneys 
would be the most unambiguous gesture of a party’s 
resolve in the contest. 

Similarly, a regulated firm could undertake a variety 
of costly expenditures, such as making charitable con- 
tributions or running public service announcements. 
However, political expenditures—whether in the form 
of legislative lobbying or employee campaign contri- 
butions coordinated through corporate PACs—most 
clearly signal the willingness of regulated parties to 
contest agency decisions, in the political arena if nec- 
essary.’ This intuition helps us to understand why cor- 
porate political expenditures appear low as a fraction 
of corporate revenues and assets. Just as the act of 
retaining counsel is cheaper than litigating, signaling 
a willingness to fight politically is necessarily cheaper 
than the actual fighting. 

Note that this mechanism need not imply that politi- 
cal expenditures never play the roles we normally asso- 
ciate with them. Lobbying still facilitates the reception 
of interest group messages. Corporate executives may 
still employ PAC contributions to secure the incum- 
bency of legislative allies, buy access, or secure favor- 
able legislation. Our theory does suggest that if polit- 
ical expenditures can be used to extract concessions 
from an agency, then firms have less reason to pursue 
these more drastic measures, making blatant quid pro 
quos rare. Indeed, such a mechanism would be valu- 
able to the legislature itself precisely because it gen- 
erates benefits for the members without their having 
to take any action (let alone an unseemly one) at all. 

Up until this point, we have been deliberately vague 
in articulating precisely which political expenditures 
send the clearest signal. As noted above, campaign 
contributions to legislative candidates constitute only 
a fraction of corporate political expenditures. A more 
elaborate story (beyond the scope of this paper) would 
be needed to explain the choice between corporate 
PAC expenditures and lobbying (although, critically, 
PAC funds come from employee contributions and not 
directly from the company treasury). To keep the ex- 
position as simple as possible, we assume that firms 
allocate political expenditures optimally given their 
particular goals, noting that different types of politi- 
cal expenditures made by individual firms are highly 
correlated (Ansolabehere, Snyder, and Tripathi 2002; 
Wright 1989). 


3 In our theory, costly mgnaling by a firm follows immediately from 


its intention to fight. An alternatrve approach would conceive of 
the signal as an evolved convention, much hke a dog unconsciously 
baring its fangs when angry or threatened. We thank an anonymous 
reviewer for suggesting this alternatrve. 

4 Note that the theory 1 also analytically distinct from the famous 
“iron triangle” account (Freeman 1965): (1) Multiple firms exist 
within the same industry and are treated asymmetrically; (2) the 
agency does not provide benefits to firms—all else equal, firms would 
prefer the agency not exist; and (3) relevant legislators do not actively 
protect the agency 
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MODEL 


Our model is one of strategic costly signaling between 
the firm and the agency. We are not the first to sug- 
gest that political expenditures play the role of signals 
to policy makers. Lohmann (1995) considers a model 
in which lobbyists with extreme preferences must pay 
a premium (a contribution) to credibly signal private 
information. Ball (1995) shows how campaign contri- 
butions can signal an interest group’s concern about a 
particular issue. Likewise, we are not alone in suggest- 
ing that firms enter the political market in order to re- 
duce the costs imposed by regulators. In discussing the 
services that a campaign contribution might buy, Baron 
(1989) points specifically to legislative intervention 
in the bureaucracy on behalf of the contributor, and 
Jobnston (2002) shows that purchases of such interven- 
tion may sometimes have informational spillovers. An 
important distinguishing feature of our theory, how- 
ever, is that firms (or their employees) undertake polit- 
ical expenditures in expectation of benefits that entail 
no action on the part of elected officials. 


Basic Structure 


The sequence of the game is as follows. First, Nature 
chooses the firm’s type t, 0 < t < T < Th, which con- 
trols the cost of complying with regulations and which 
is revealed only to the firm. Nature then chooses 
whether or not to sound a publicly observable fire 
alarm (cf. McCubbins and Schwartz 1984) correspond- 
ing to a compliance problem with the firm.’ Next, 
the firm chooses its political contribution c > 0 and, 
simultaneously, its level of regulatory compliance k, 
0 < k< 1. Finally, the agency observes the firm’s con- 
tribution and then chooses the level m at which it will 
monitor the firm. If an alarm has not sounded, the 
agency has complete discretion in choosing the level of 
monitoring, 0 < m < 1.8 If it has sounded, the agency 
must perform a mandatory inspection corresponding 
to a level of monitoring 7A, but it may monitor the firm 
more than that, if it so chooses, so that M < m < 1. Fol- 
lowing these choices, the agency observes a proportion 
of the firm’s violations corresponding to the level of 
monitoring of the firm, e.g., if the agency monitors the 
firm at a level m = 4, then it observes a third of the 
firm’s violations. The firm must correct all observed in- 
fractions and pay fines that are imposed according to an 
exopenously determined schedule (discussed below). 


5 We thank Matt Stephenson for bringing the latter to our attention. 
6 Such costs may be either tangible, reflecting technological or insti- 
tutional variation between firms, or intangible, reflectmg the pres- 
ence or absence of a “sense of duty” to comply (Scholz and Pinney 
1995). 

7 Because we are interested ın the firm’s choice of contribution and 
the agency’s response to it, we concern ourselves only with alarms 
that sound before the contnbution 1s made. Although ıt 1s empirically 
true that an alarm may also sound after the contnbution has been 
made, ıt cannot affect the contribution. In order to express the salient 
features of the model as simply as possible, we dispense with the latter 


teeta 
Making inspection intensity strictly positive for all firms would not 
affect our results. 
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The firm must bear as costs its contribution, fines 
for documented infractions, and the costs associated 
with compliance, including the costs of correcting doc- 
umented infractions. Let represent the magnitude of 
the violation that the firm would produce if it made 
no effort to comply with regulations, i.e., if A= 0, and 
assume it is common knowledge between the firm and 
the agency. This variable captures commonly known 
features of the firm (eg., size) that contribute to its 
innate propensity to generate violations. Let f (k) rep- 
resent the schedule of average fines imposed on the 
firm per unit of detected infractions. Because fines are 
a punitive measure, the fine per unit is increasing in the 
level of noncompliance so that, controlling for the in- 
nate propensity of the firm to commit violations, worse 
violators are fined more punitively.” Fines are levied 
only for documented infractions, which are equal to 
m(1 —k)w. The total fine assessed against the firm is 
the product of the per unit fine, which is determined by 
its level of compliance, and its documented infractions. 
Firms with lower levels of compliance face higher to- 
tal fines both because they must pay higher rates and 
because they must pay for more violations. 

The firm’s type, known only to it, is its cost per unit 
of reducing violations. The total cost of compliance for 
the firm includes the cost of rectifying the violations 
that the agency observes as well as the cost of its initial 
chosen compliance level. A firm of type t and size w that 
complies with regulations at level k and is monitored 
at level m bears a cost of compliance t(k+ m(1 — k) Jow. 

The agency benefits from the rewards associated with 
documenting infractions and from alternative expen- 
ditures on, inter alia, administrative costs, overhead, 
educational expenditures, and, importantly, monitor- 
ing other firms (here unmodeled).'" The agency faces a 
budget constraint b. Since compliance must be verified 
for the entire firm, the amount of monitoring depends 
both on its known violation propensity and on the thor- 
oughness of the inspection. The average per unit cost of 
monitoring is a(m). We assume that, controlling for the 
size of the firm, a thorough inspection is more costly 
to perform than a superficial one, so that the average 
per unit cost borne by the agency is increasing in the 
level of monitoring. The total cost of monitoring is then 
given by a(m)mo. 

From the budget constraint, the funds remaining 
available for alternative expenditures is b — a(m)mw. 
We assume that b > a(1)m, so that the agency could 
monitor the firm fully if it chose to do so. The agency’s 
reward per unit of documented violations is r(t). Be- 
cause, as we discussed above, a firm with higher costs 
of compliance is more willing, ceteris paribus, to con- 
test the agency’s findings ex post, the agency’s per unit 
reward is assumed to be decreasing in the firm’s compli- 
ance costs. This assumption plays an important role in 


9 This assumption corresponds to the NRC's enforcement policy; see 
US. Nuclear Regulatory Commission 2000 Fines can be a function 
of compliance because the agency correctly infers it once it inspects 
10 Ryplicitly modeling the agency’s allocation across firms and the 
potential resultant competition among firms 1s beyond the scope of 
the current analysis. 
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supporting the existence of a separating equilibrium. 
Note that although the agency’s reward per unit of 
documented violation is lower for a firm with a higher 
cost of compliance, the total reward to the agency for 
a given level of monitoring may be higher or lower, 
depending on how much the higher-cost firm reduces 
its level of compliance. These possibilities entail very 
different incentives for both the agency and the firm, a 
point that is examined more carefully below. 

To minimize mathematical complexity, we assume 
linear functional forms for f (k) and a(m). Because 
these functions represent per unit fines and costs, they 
correspond to increasing marginal fines for noncomp- 
liance and increasing marginal costs of inspection. The 
agency’s indirect utility function, u4, may be written 


u“(m, k, t) = b—a(m)mo + r(t)m(1 — Kho 


and is concave with respect to m, the agency’s choice 
variable. The firm’s indirect utility function, u”, is 


u™(m, k, c, t)=—c—f (k)m(1 — Ko —1(k+m(1—1)o 


and is concave with respect to compliance k.t 

The agency’s prior beliefs about the firm’s costs of 
compliance are described by the cumulative probabil- 
ity distribution P(r), such that P(t) is continuous and 
strictly increasing in t from tz to ta. The associated 
probability density is p(t). Denote the agency’s expec- 
tations about the firm’s strategy, which are correct in 
equilibrium, as k*(r) and c*(t). The agency’s expected 
utility, given the information available when it chooses, 
is 


E[u^ (m, k, 1) | P(t)] = b — a(m)ma 


+mo | p(tle)r(t)(1— KO) at. 


Equllibrium 


We derive the separating sequential equilibrium in Ap- 
pendix A and discuss it informally here. A separating 
equilibrium is one in which different types of firms 
choose different publicly observable contributions. As 
in all continuous signaling games, a pooling equilib- 
rium, in which different types choose the same actions, 
also exists. Because such an equilibrium would corre- 
spond to identical contributions for all firm types and 
a constant level of monitoring behavior for the agency, 
it would fail to capture the causal mechanism we the- 
orize. Hence we limit our attention to the separating 


11 Although our specification of the model rs deliberately kept sample 
to avoid algebraic complications, its results extend relatively easily to 
more general functional forms. The assumed linearity of the fines and 
of the agency’s costs of inspection does not drive the main substan- 
trve results, nor does it interfere with the derivation of comparative 
statics. As we show below, in this specification, the agency always 
chooses to monitor the firm (m > 0) but not completely (m < 1),1¢., 
the agency never chooses a corner solution. Because the agency’s 
choice of the level of monitoring is effectively unconstrained (when 
an alarm has not sounded), it is able to adjust its behavior in either 
direction in response to variation m the firm’s contnbution. 
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equilibrium. We employ the sequential equilibrium re- 
finement to ensure that players modify their beliefs in 
accordance with Bayes’ rule in response to new infor- 
mation and that each player’s behavior maximizes its 
expected utility at the time of action.” 

In solving for the separating equilibrium, we also 
identify the conditions under which it exists, i.e., con- 
ditions under which the firm’s contribution correctly 
reveals its type to the agency. Conditions for existence 
are of particular substantive interest here because in 
equilibrium, the agency infers not only the firm’s type 
from its contribution, but also its chosen level of com- 
pliance. The agency obtains a smaller per unit reward 
for documenting infractions at firms with high compli- 
ance costs, but it also finds more infractions at those 
firms. When the latter incentive dominates, the total 
reward increases in the firm’s type, and the agency will 
prefer to monitor higher-type firms more. In that case, 
those firms will not be willing to distinguish themselves 
from lower-type firms through costly contributions. The 
conditions for the existence of a separating equilib- 
rium characterize circumstances under which the for- 
mer incentive dominates the latter. When the per unit 
reward to the agency decreases rapidly enough as the 
firm’s costs of compliance increase, the separating equi- 
librium exists. We present this condition formally in 
Appendix A. 

The game consists of two subgames that can be an- 
alyzed independently: one in which the alarm has not 
sounded and one in which it has.” In each, the firm 
chooses a level of compliance, which is not directly 
observed by the agency, and a contribution, which is. 
The agency then chooses how much to monitor the 
firm. The difference between the subgames is that in 
the former, the agency can choose to monitor the firm 
as much or as little as it likes, whereas in the latter, it 
must perform at least a mandatory minimum inspec- 
tion. We characterize equilibrium behavior in each of 
these subgames and then compare them. 


Pure Patrol Subgame. We solve by backward induc- 
tion in order to ensure that each player’s choice of 
action best serves his or her own interests, from the 
perspective of his or her own beliefs, at the time at which 
he or she acts. We first characterize the agency’s choice 
of the level of monitoring, given its knowledge of the 
basic setup of the game, the firm’s equilibrium strategy 
k*(z, c) and c*(r), and its inference (in the separating 
equilibrium) of the firm’s type from its observed con- 
tribution. We then characterize the firm’s optimal con- 
tribution and level of compliance, given that the firm 
knows its own choice of each and correctly anticipates 
the agency’s response, m*(k*(-), c*(-)). 


12 More precisely, we derive the unique separating sequential equi- 
librtum behavioral profile, which, in conjunction with the appropriate 
beliefs, constitutes the unique untversally divine equilibrium. For an 
overview of signaling games in political science, see Banks 1991. 

D Making fire alarm occurrence endogenous (e.g. as a function of 
compliance) would have the following effects, neither of which would 
alter the substantive mtuition of our more parsimonious account. 
(1) Firms would have additional incentives to comply aside from in- 
creasing marginal fines, and (2) the agency’s beliefs in each subgame 
would change 
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In equilibrium, the levels of compliance and contri- 
bution are such that the value to the firm of an ad- 
ditional unit of resources committed to either is zero 
for all types of firms. However, a firm with a higher 
per unit cost of compliance chooses to comply less. To 
compensate for the lower compliance, which, ceteris 
paribus, results in higher marginal fines, the higher- 
type firm makes higher political contributions in order 
to obtain a lower level of monitoring, thereby ensuring 
that a smaller proportion of infractions is detected. 

The agency in equilibrium always chooses to monitor 
the firm somewhat, but never completely. The marginal 
cost of monitoring is increasing for the agency and is 
independent of both the firm’s behavior and its type. 
In contrast, the marginal benefit from monitoring de- 
pends on both the firm’s type and its chosen level of 
compliance, and the restrictions discussed above on the 
schedule of per unit rewards for the agency ensure that 
the marginal benefits of monitoring are decreasing in 
the firm’s type. Thus, ceteris paribus, the agency mon- 
itors higher-type firms—which are identified in equi- 
librium by their higher contributions—less than lower- 


type firms. 


Alarm Subgame. The analysis of the subgame that 
follows the sounding of the alarm differs from that 
of the other subgame only in the imposition of a po- 
tentially binding constraint on the agency’s behavior. 
Because the agency must monitor at least M^, the firm 
has no incentive to offer a contribution in excess of the 
amount necessary to induce the agency to choose that 
minimum level of monitoring. Similarly, the firm’s level 
of compliance reflects its anticipation of being moni- 
tored at least that much. Thus, not only is the agency’s 
equilibrium monitoring strategy constant at 7h for all 
contributions greater than some critical value ê, but 
also the firm’s equilibrium strategy c*(r) is constant 
at ĉ for all t > ?. Although the equilibrium level of 
compliance is decreasing in the firm’s type throughout, 
it does so more slowly for t > ? when the alarm has 
sounded than when it does not. 

Two propositions summarize our results. The first 
characterizes behavior in both subgames: 


Proposition I. In the unique separating sequential 
equilibrium behavioral profile: 


(1) the level of the agency’s monitoring of the firm, 
and hence the proportion of committed violations 
that are detected, is decreasing in the size of the 
contribution, ceteris paribus; 

(2) the size of the contribution is increasing in the per 
unit costliness of compliance for the firm, ceteris 
paribus; and 

(3) the level of the firm’s compliance is decreasing 
in the per unit costliness of compliance, ceteris 
paribus. 


Thus, in both subgames, firms that make larger con- 
tributions commit more infractions, holding firm size 
constant, yet are monitored less. The second proposi- 
tion compares behavior in the unique separating se- 
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quential equilibria of the alarm and pure-patrol sub- 
games, 


Proposition 2. For sufficiently high costs of com- 
pliance: 


(1) the firm contributes strictly less when the alarm 
sounds than when ıt does not!“ 
(2) the firm is strictly more compliant when the alarm 
sounds than when it does not; and 
(3) the agency monitors the firm strictly more when 
the alarm sounds than when it does not. 
For all other types, the alarm has no impact on the be- 
havior of the firm and the agency. 


AN APPLICATION OF THE THEORY 


Three of the six predictions above are testable. As 
firm political expenditures increase, regulatory over- 
sight should decrease. Firms with publicly observable 
problems (fire alarms) should reduce their political 
expenditures. And inspections should increase where 
fire alarms have sounded. To test these predictions, we 
collected data on the monitoring activities of the NRC 
at 63 privately operated nuclear power plants over the 
nine-year period from 1994 to 2002, and campaign con- 
tributions from their associated operating companies’ 
PACs. 


Monitoring Activity of the Nuclear 
Regulatory Commission 


Since 1974, the NRC has been charged with licensing 
the construction, operation, and decommissioning of 
civilian nuclear plants; licensing the possession, use, 
handling, and export of nuclear material; investigating 
nuclear incidents; and monitoring the activities of li- 
censed facilities (Office of the Federal Registry 2003). 
Inspection of facilities to ensure their safe operation 
constitutes the bulk of the fieldwork conducted by the 
Commission’s four regional offices. Several key fea- 
tures of the oversight program differ from those of the 
inspection programs of other federal agencies (such as 
the EPA or OSHA). First, the NRC maintains a full- 
time inspection presence at each facility in the form of 
“resident inspectors.” In addition to this presence, the 
regional offices may allocate plant-specific supplemen- 
tary monitoring resources. 


14 Another plausible way to model the effects of fire alarms is to 
assume that the reward to the agency per unit of observed violation 
increases after the alarm sounds. If political expenditures were di- 
rectly purchasing reduced oversight in the sense of a quid pro quo, 
such an increase would drive up the “purchase price” of forbearance. 
Accordingly, we would expect to see firm contributions increase after 
a fire alarm. By contrast, ın our theory a firm’s expenditures affect 
regulatory oversight only through their impact on the agency’s beliefs 
about the firm’s type. If the agency’s rewards to regulation are so high 
that regardless of its beliefs, it still has an incentive to regulate, then 
that firm will no longer find ıt worthwhile to make these expendi- 
tures. Our prediction that fire alarms will lead on average to fewer 
political expenditures is therefore sustained given this alternative 
formulaton. Effectively, our empirical analysis provides a critical 


test for distinguishing between an exchange-based account and our 
informational approach. 
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Second, much of the inspection activity of the NRC 
is financed by fees billed to operators. Importantly, 
this does not eliminate the agency’s budget constraint. 
Congress establishes programmatic budgetary author- 
ity for the NRC’s Reactor Inspection and Performance 
Assessment program ($72.7 million in fiscal year 2002, 
with 611 full-time-equivalent employees). From 1990 
to 2000, federal statute required the Commission to re- 
coup approximately 100% of its authorization through 
fees. This percentage has been reduced, albeit only very 
slightly, in the past several years. In 2002 and 2003, 
the agency billed a professional rate of $150 per hour 
for reactor inspections. This procedure will allow us 
to construct a more stringent test of the expenditures 
hypothesis below. 


Data and Method 


We have data on the inspection and operation of 74 
civilian nuclear power plants from 1994 to 2002. For 
the current analysis, we restrict attention to the 63 
plants run by private companies. Public authorities and 
cooperatives are presumed not to behave according to 
the causal mechanism outlined above. In our test of the 
expenditures hypothesis, the dependent variable is the 
number of hours spent inspecting a plant in a given 
year (source: NRC Regulatory Information Tracking 
System). There is considerable variation in the inten- 
sity of monitoring at different plants. For example, 
the NRC billed enough hours at Northeast Utilities’ 
troubled Millstone plant in Waterford, Connecticut, in 
1998 to employ nearly six inspectors full-time. At the 
opposite end of the spectrum, the NRC billed hours 
for Dominion’s North Anna plant in Louisa County, 
Virginia, for just half of one full-time equivalent. Total 
inspection activity of the Commission has also varied 
over time; especially noteworthy is a reduction that 
began in 2000, as the NRC began relying increasingly 
on ostensibly neutral measures of plant risk. 

Our theory posits an association between political 
expenditures and inspection activity. Clearly, however, 
other factors will influence the extent of agency moni- 
toring at a particular plant.» First, plants vary in their 
technical features, and operating companies in their 
structural characteristics. We employ fixed effects spe- 
cific to individual plant—operator dyads (72 total) to 
control for all time-invariant characteristics relating to 
a site under the control of a particular firm. (A plant 
that changed ownership once during the sample period 
would be represented by two fixed effects.) If a plant’s 
technology or construction makes it particularly error 


15 The necessity of control variables 1 critical for theoretical as well 
as methodological reasons. One can envision an alternative theory 
in which the regulator is fully mformed of firms’ compliance costs 
based on 1ts knowledge of their observable features. Such a model 
would predict no relationship between firm political expenditures 
and agency oversight, except to the extent that political expenditures 
made for other purposes serve as proxies for omitted variables mea- 
suring salient firm characteristics If political expenditures are found 
to influence agency activities m the presence of a well-specified set of 
controls, our signaling theory will prove to have greater explanatory 
power than this full-information account. : 
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prone, or if an operator’s corporate structure impedes 
the safe operation of the plant compared to that of 
other operators, such features’ impact will be absorbed 
by the fixed effects.1° We also gathered data on the 
operating characteristics of individual plants (source: 
Energy Information Administration). In addition to 
the plant-specific effects, we control for time-varying 
characteristics including the number of reactors on line 
at the plant (which varies over time at several sites), the 
capacity of the plant in Megawatts Electric (MwE), the 
proportion of total hours in the year in question that 
the plant was on-line, and the average operational age 
of reactors at the plant. 

Second, the NRC views plants in more populated 
areas as inherently more dangerous. This measure will 
be spanned by the fixed effects, but for specifications 
that exclude the dummies, we control for the popula- 
tion density per square kilometer in the surrounding 
county. 

Third, in the past decade, the Nuclear Energy Insti- 
tute, which lobbies on behalf of the nuclear industry, 
has defended market consolidation of nuclear utilities 
on the grounds that scale economies permit larger com- 
panies to enjoy superior safety records. We therefore 
control for the total assets of the operating company 
in thousands of dollars!” as well as the total number 
of nuclear plants operated by the company during that 
year. Controlling for company assets also insures that 
our measure of campaign expenditures is not merely 
picking up the effect of firm size. 

Fourth, any number of time-specific factors could 
influence the Commission’s decision to increase or de- 
crease inspections across all plants. For example, in 
2000, the NRC revamped its inspection process; the 
immediate effect has been a reduction in total reactor 
oversight. Also, the agency’s budget and the hourly 
rate it charges vary from year to year. Rather than 
attempting to control for all potentially relevant inter- 
ventions with cross-reactor effects, we instead employ 
year-specific dummy variables.18 

Finally, we examine the operating company’s associ- 
ated campaign expenditures (PAC contributions to and 
on behalf of candidates) for that year (source: Federal 
Election Commission). Since our causal story concerns 
political expenditures more generally, we might em- 
ploy the sum of lobbying expenditures and campaign 
expenditures as a comprehensive measure of political 
investment. Since 1995, lobbying firms and corpora- 
tions with in-house lobbyists have been required to 
disclose their expenditures to the House and Senate 
twice a year. However, the disclosure process is not 
as standardized as the FEC reporting system, and we 


16 The effects also capture vanation among regional offices; see 


Whitford 2002. 
17 All dollar values deflated to 1996 equrvalents using the GDP Im- 
pet Price Deflator. 

8 The year-specific effects also permit us to control for changes in 
the preferences of the NRC’s pohtical principals and commissioners. 
Questions of overhead political control are important (e.g, Wood 
and Waterman 1994) but beyond the scope of the current analyms 
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TABLE 1. Summary Statistics for Model Variables 


Variable 
Plant level (N = 535) 
Inspection hours at plant 
Reactors on-line 
Energy capactty at plant (MwE) 
Proporton of year reactor on-line 
Reactor type (0 = bolling water; 1 = power) 
Average operational age across reactors 
Population density In reactors county 


Regulatory violations 
Scrams 


Operator level (N = 310) 
Numer of operated plants 
Operator cash on hand (millions) 
Operator total assets (millions) 
Operator profit ratio 
Campaign expenditures (thousands) 


Expenditures to oversight committees (thousands) 


Observed corporate ch thousands 





SD Min Max. 


15.22 
1,831.90 


“Negative expenditures induced by transfers from candidates to PACs. 


encountered numerous errors in the reports.’ Because 
of these difficulties, and because of the high correlation 
between lobbying and campaign expenditures noted 
above (and confirmed in our preliminary analysis of 
nuclear operating companies), we employ campaign 
expenditures as a proxy for broader political expendi- 
tures. Table 1 presents summary statistics for the vari- 
ables employed in our analysis. 


The Potential for Endogenelty Blas 


Moving to an empirical specification necessitates rec- 
ognizing the endogeneity of expenditures (a choice 
variable). We employ an instrumental variables esti- 
mator that resembles two-stage least squares (2SLS), 
with an important exception: 2SLS uses all exogenous 
variables in a system of equations for the first stage re- 
gression. This approach is inappropriate in the current 
context, because the unit of analysis for the inspection 
equation is the plant, while the unit of analysis for 
the expenditures equation is the company. Thus, we 
employ only firm-level covariates in the first stage. 
Reliance on the instrumental variables methodo- 
logy requires identifying exogenous covariates corre- 
lated with campaign expenditures, but that only affect 
inspections indirectly through the contribution them- 
selves. The firm-level variables we employ are finan- 
cial measures derived from the operating companies’ 
10-K filings and their first differences. The logic of 
employing first differences is that certain changes in 
the financial status of a company could not be per- 
ceived directly by the NRC until the publication of 
the company’s financial statements and, thus, should 
not influence inspection decisions. These variables, of 


19 Some lobbyists report expenditures for the full year on year-end 
reports, while others report only those from the last sx months. Also, 
some firms file both as registrants and as clients; although they are 


supposed to report expenditures as clients in their registrant totals, 
not all do so. 
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course, are functions of firm choices. We justify their 
use as instruments by noting that the financial data 
typically reflect investment and other financial deci- 
sions having nothing to do with the day-to-day op- 
erations of the plants, as well as external economic 
conditions. The right-hand-side variables appearing in 
the first stage regression along with their first dif- 
ferences are the firm’s total assets (which also ap- 
pears in the inspection equation), cash on hand, and 
profit ratio (net income divided by total assets). Also 
included are firm-specific and year-specific indicator 
variables. Although for reasons of space we do not 
present the results of the first stage regression here, 
the results are similar to those displayed in Table 3 
below. An F-test suggests the joint significance of the 
excluded instruments (see Bound, Jaeger, and Baker 
1995). 


Results 


The Expenditures Hypothesis. Table 2 presents re- 
sults from eight different specifications of the full 
model. The dependent variable is the number of hours 
the NRC inspected plant i in year t. For purposes of 
comparison, we present OLS estimates along with IV 
counterparts. Although the IV estimates are to be pre- 
ferred to OLS on theoretical grounds, Hausman tests 
comparing the estimates suggest no significant differ- 
ence between the two. Thus, OLS estimates are ar- 
guably preferable for efficiency reasons. Columns (1) 
and (2) display results of the basic estimation with 
neither fixed effects nor lags, while (3) and (4) add 


2) We employ financial attributes of the parent companies, where 
applicable, and the campaign expenditures of both parent and sub- 
sidiary. 


Vol. 99, No. 2 


American Political Science Review 


‘(81964 PeTE]-OM}) 10°O > d, PUB GOO > d, ‘seeelquered uj sonSsiEre-) jue|stsU0o-AjDASBpeceOJe}ey esuada Jo enwa EINjoeqy JON 
CN PEA ee Pe TEN U EORSTB TO) JUB]SISUCO-AIPASBPOOSOIO]OY OSMANA JO EN[BA GINjOEqy JON 


oly GLY GLY GLY StS SES SEG Sts N 





— 780 — €2'0 — €8'0 — 2S'0 zH 
SO, 894 ON ON s94 Sah ON ON 8}28}Ə IBA, 
894 S94 ON ON sa, sal ON ON GOO Pexy JO}BIEdO-jUBiq 
(€8"e) (68'£) (09'9) (6S9) (09°9) (699) (v66) (98`6) 
= 18°862'Z «9h SOC'Z mOS'E16'Z ~86'626'Z ~86'00Z'E mya’ S69'E =ZE'Z6£'S «09 FBP'S }deas6}u| 
(veL) (SE'L) (90`0) (200) 
Zs'9 +99 6}`0— zgo- — — — — SUORBJOJA pebhey 
(ozz) (9¢'2) (€6'6) (6201) 
«710 „S'O »E9 0 =S 0 = — = — sunou uoppoedsu; peb6e7 
(86°c) (06°2) (LLE) (Yre) (vo'e) (€9°Z) (Lb'p) (ZZY) 
~80'F6 ~/8' 18 -BP' LS wt LLP „SLY =E£8'96 wLLLEL =98 IZL poguos sJoyaiedo JepUN s}uejd 
(08'0) (29°0) (96'0) (Ze"0) (LE'L) (6v4) (98`0) (#0'L) 
LO'O LO'O 00'0— 00°0- LO'O LO'O LO'O LO'O 816888 [BOL V 
(8p) (€z'4) (L'Z) (6z'Z) (@¢" 1) (S9"L) (96° 1) (82'2) 
Ł0'0— L0`0— .}Ł0'0— .„Ł0'0— L0'0— 0'0- Z0'0— .Z0'0— gjesse [a] 
(zes) (eg'S) (69'0) (16°0) (L6°LL) (90°L L) (99 |) (22' 1) 
~Z/'96 EC Y6 L0'Z—- LEZ- ~LO'OSL ES BFL ZS'8Ļ-— Z0'0Z— oe feucpaiedO 
(oz`4) (LZ'L) (90) (Z'0) 
— — 9'0 9'0 — — S'O GLO Ayrsuep uopejndod 
(Z'L) (2€`4) (96° 1) (LPL) 
— — S6'£4— LE'EZ— — — 69'Z94— 60'£94— edf] 1089H 
(26'S) (0'9) (99'S) (99'S) (0€°Z) (6° 2) (66'S) (OL'9) 
=OV'OOE'Z— sm EOS LE‘Z— = E 9ZE'Z- „EE SEE'Z— = cBOGS'2— ss am PH'EBG'Z— sm BSOEO'E— mh SZZ‘e— BU|FUO 1079881 % 
(€9°6) (896) (09'0) (6S°0) (p2°Z) (L02) (610) (940) 
#B6E'Z „BEZ LE'O LE'O 96" | ~S6' | Go'0 ¥0'0 (SMW) Aypedeo ABieuy 
(z8r) (L6H) (8'4) (L6'L) (22'9) (Lb'9) (68"e) (ZZ £) 
„0E 6Y8'L— sm BG PBL‘ — 96682 YL’ Ze2 =69'6SL'Z—  sOV'ZVOZ— ss mL BB „99'206 Əulļj-U0 S10}LeH 
(¢g‘2) (29°2) (66'e) (00'r) (S9°2) (zZ) (7G'9) (ve's) 
.90'E— =9EZ— m£O'C— =9EZ— abo b— L'Z- „t0 L— #GL'G— saunypuedxe ubjedweg 
Al STO Al S10 Al S10 Al S10 erqepe, Houe 
(8) (2) (9) (S) (p) (£) (z) (1) 





yoy Aole Boy DUN [EAS + ]UBIq UO senypuodxa [BIpI/Og 10}81ed6 Jo peyg CUL Z J1aYL 


253 


Flexing Muscle: Corporate Political Expenditures 


plant- and year-specific effects. Columns (5) through 
(8) replicate the analysis, adding lagged values of in- 
spection intensity and regulatory violations.” 

Before proceeding to the substantive results, we note 
that one can easily reject the null hypothesis for all 
four specifications that the plant-specific fixed effects 
are jointly equal to zero. Likewise, one can reject the 
null for the joint significance of the year-specific effects 
in each regression. Finally, introducing a dynamic ele- 
ment to the model in the form of the lagged dependent 
variable is also a statistically significant contribution. 

Consider first the measure of the number of reac- 
tors on line at a site. In the specifications with no 
fixed effects, the measure is statistically significant and 
positive at at least the 10% level, whereas in speci- 
fications with fixed effects included, the coefficient’s 
sign dramatically switches. This is because only three 
plants varied in the number of operational reactors 
over the time period examined: Maine Yankee, Zion, 
and Millstone. The former two reduced the number of 
reactors during a site decommissioning process (which 
increased inspection presence), while Millstone’s diffi- 
culties simultaneously boosted inspection presence and 
necessitated the decommissioning of one of its reactors. 
Another technical measure of note is the percentage of 
time the reactor is on-line: Both forced and unforced 
outages typically boost inspection presence, while sus- 
tained smooth operation decreases it. 

On the one hand, operators with large total assets 
are inspected less, although the statistical significance 
of total assets varies across specifications. However, 
controlling for total assets, the more plants an operator 
runs, the more those plants are inspected. On the one 
hand, this could imply that the Commission suspects 
a company’s resources are spread thin. More likely, it 
could result from the fact that in the late 1990s through 
2002, a few big firms such as Exelon and Entergy began 
buying up troubled plants. 

Finally, we turn to the campaign contribution vari- 
able. In all eight specifications, the coefficient on the 
measure is negative and highly statistically significant, 
in line with the prediction of the theoretical model. 
Depending on specification, a $1,000 increase in expen- 
diture produces an estimated drop in inspection hours 
anywhere from slightly more than two hours to more 
than seven. An increase from the 25th percentile of 
contributions to the 75th percentile (about $13,800 to 
$76,200) would produce anywhere from a 147 (+110)- 
to a 439 (+134)-hour reduction in plant inspections 
This corresponds to a reduction of from 5% to 16% 
from the median inspection intensity of 2,800 hours” 


21 The NRC and its regional offices do not commit to a fired monitor- 
ing allocation across plants each year, mstead adjusting the allocation 
as necessary Thus, even though our theoretical model suggests that 
campaign contributions precede monitoring, it 1s not desirable to 
employ a one-year lag on campaign expenditures in the empirical 
specification Modeling continuous adjustment by the agency 18 a 
task we leave for future research. 

2 The finding of a negative relationship persists in mmpler speci- 
fications, for example, the pairwise correlation between mspection 
intensity and contnbutions 1s a statistically significant —0.35. 
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A More Stringent Test. We can adopta more stringent 
criterion for our analysis: The magnitude of the coef- 
ficient on campaign contributions cannot be too big. 
Our theoretical model demonstrates the existence of 
a separating equilibrium in which firms with different 
private compliance costs contribute different amounts. 
In the case of nuclear regulation, there are additional, 
easily observed costs to inspection: the fees that the 
NRC charges firms for oversight. 

Now consider the observable savings to the average 
firm associated with $1,000 of political expenditures. In 
adjusted 1996 dollars, the average inspection fee listed 
in the Code of Federal Regulations from 1994 to 2002 
was $129.56 per hour. The average firm in the sample 
operated 1.71 plants in any given year, implying that 
a hypothetical inspection reduction of one hour across 
all plants would result in an average net savings of 
$221.55 per operator. For this hypothetical firm to 
break even on a $1,000 investment in terms of observ- 
able direct costs alone, the marginal reduction in annual 
inspections would therefore have to be 4.51 hours per 
plant. 

One might initially conclude that it would be irra- 
tional for our hypothetical average firm to contribute if 
it got bang for the buck less than 4.51 hours. In fact, our 
theory’s causal mechanism suggests just the opposite: 
If the marginal reduction were greater than 4.51 hours, 
the separating equilibrium would collapse: Any indi- 
vidual firm, irrespective of its private costs, would have 
a best response of increasing its expenditures, driving 
inspections all the way down to zero. In such a situa- 
tion, however, campaign contributions would reveal no 
private information to the regulator, so the regulator 
would fail to update and inspect naively. Thus, 4.51 
hours represents the absolute maximum reduction that 
could in principle sustain the equilibrium.” 

Figure 1 displays the distribution of the estimated 
B’s on the expenditure variable across our eight model 
specifications. The vertical lines in each panel are po- 
sitioned at —4.51 and 0, respectively. Regions under 
the density shaded gray denote the probability that the 
parameter value falls outside the critical region. ‘The 
number in the upper-right corner of each panel de- 
notes the percentage probability that the parameter 
falls within the region. 

In five of the eight specifications, the parameter falls 
within the specified range with 87.8% or greater proba- 
bility, and in one with approximately 56% probability. 
In the remaining two, the magnitude of the estimate 
is too large. Note that these two specifications place 
constraints on the model that statistical tests suggested 
were unwarranted: The large coefficient values in the 
models with neither lags nor fixed effects most likely 
reflect the fact that the contribution variable is picking 
up variance attributable to other company or plant 


B In practice, the figure may be quite a bit smaller than 4.51, because 
the NRC might be able to observe additional direct costs that the 
analyst cannot. Also, this assumes that the sole purpose for campaign 
expenditures by nuclear operators is to signal the NRC, a far stronger 
claim than any we wish to make here. 
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FIGURE 1. The Magnitude of the Expenditure Effect Across Specifications 
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characteristics or secular changes in contribution be- 
havior and inspections over time. 

Readers might object that this test is problematic 
because campaign contributions play the role of proxy 
for a broader set of political expenditures. However, 
if the effects of other political spending are picked 
up by the contribution variable, we would anticipate 
that the magnitude of the coefficient on the unobserv- 
able total expenditures variable would be smaller than 
what we observe, lending additional support to the 
test. 


The Fire Alarm Hypotheses. Our theory also implies 
that when an operating company’s plants produce vi- 
olations that are observable to actors other than the 
NRC and the operator, compulsory oversight reduces 
the incentive of firms to invest politically. Thus, we 
ought to observe smaller contributions in observably 
troubled firms. Both Northeast Utilities (the operator 
of Millstone) and FirstEnergy (the operator of Davis- 
Besse) reduced their campaign expenditures in the 
year following embarrassing revelations at their plants. 
Unfortunately, these reductions occurred during off- 
election years. Also, the revelations come from NRC 
inspections. Testing the fire alarms hypothesis statisti- 
cally requires a more general fire alarm measure that 
satisfies two criteria. First, it should be minimally sub- 
ject to opportunistic manipulation by plant operators 
and not rely on determinations made by regulators. 
Second, it should be publicly observable. 

The NRC collects a number of indicators to assess 
plant safety performance. One measure that comes 
closest to meeting the above criteria is the number of 
unplanned, automatic shutdowns, called “scrams” in 
industry jargon, at a reactor in a given year. During a 
scram, the neutron-absorbing control rods plunge into 
the reactor pressure vessel, abruptly ceasing nuclear fis- 
sion. Scrams may occur for a host of reasons, including 
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safety system failures, water volume or pressure loss, 
fires, and blown fuses. After a scram in 2003 at the Nine 
Mile Point in upstate New York, an NRC spokesman 
noted, “It’s like running your car at 60 miles an hour 
down the highway and then slamming on the brakes. 
It’s not very good, but nonetheless it’s designed to do 
it” (Partlow 2003). Scrams are frequently reported by 
local media outlets. 


TABLE 3. The Effect of Fire Alarms on 
Operator Political Expenditures: OLS 
Estimates 

lanatory Varlable 
scrams at operator's plants 


Plants under operator's control 
Cash on hand 

ACash on hand 

Tota! assets 

A Total assets 


Profit ratio 


(1.16) 
76.29 


(1.93) 
—156.63 
(1.90) 
59.53" 


(3.08) 

Operator fixed effects No 

Year effects Yes 

R? 0.57 

N 310 
Note. Absolute value of groupwise heteroecedasteity- 
consistent r-statistics in parentheses. *p < 0 05 and “p < 0.01 
(two-tailed tests). 


AProfit ratio 


Intercept 


255 


Flexing Muscle: Corporate Political Expenditures 


May 2005 


TABLE 4. The Effect of Scrams on Plant-Level NRC Regulatory Activity 


(1) 


Explanatory Variable OLS 


Campalgn expenditures 
Scrams 

Reactors on-line 
Energy capacity (MwE) 
% reactor on-line 
Reactor type 

Population density 
Operational age 

Tota] assets 

ATotal assets 


Plants under operators contro} 


Lagged Inspection hours 


Lagged violations 


Intercept 


Plant-operator fixed effects 
Year effects 

R? 

N 


Note: Absolute value of groupwise heteroscedasticity-conslstent tstatstcs In parentheses. *p < 0 05 and 


=p < 0.01 (two-tailed tests) 


To assess the influence of scrams on company po- 
litical expenditures, we reestimated the expenditure 
equation, this time including the number of scrams 
at plants belonging to the operator (scram data are 
from the Idaho National Engineering and Environ- 
mental Laboratory). Because contributions occur at 
the operator level, the level of analysis is the operator- 
year rather than the plant-year. Results are displayed 
in Table 3. In specification (1), the scram measure is 
negative and statistically significant. All else equal, an 
additional scram leads to a $3,630 reduction in political 
expenditures that year. In the specifications including 
fixed effects, the coefficient estimate remains negative 
but is no longer statistically significant at conventional 
levels; however, this is very likely a consequence of 
minimal variation in scrams from year to year (Beck 
and Katz 2001). 

Next, in Table 4, we reestimate several specifica- 
tions of the inspections regression including scrams as 
an independent variable. We expect to see increased 
oversight at plants where fire alarms have sounded. 
Contrary to expectation, in none of the specifica- 
tions we employed was the scram variable statisti- 
cally significant. Puzzled, we contacted the NRC and 
were informed that, indeed, following a scram, agency 
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rules require that a supplementary inspection, called 
a “95001,” be performed. What, then, explains the 
null finding? One plausible explanation that is con- 
sistent with the result is that fire alarms have two 
effects, only one of which is adequately captured by 
the model. The first is a short-term increase in in- 
spections. The second is compulsory investment by 
the operator in compliance technology that would 
lead to a long-term decrease in oversight at the plant. 
An examination of more fine-grained inspection data 
(not available at present) would allow us to distin- 
guish the short- and long-term effects more com- 
pletely. 


The Nature and Specificity of the Signal 


In presenting our theoretical story, we emphasized that 
it is political expenditures, rather than voluntary ex- 
penditures more generally, that more effectively com- 
municate a company’s willingness to dispute agency 
decisions. It is possible to construct an empirical test 
of this distinction. If any “burning money” will suffice 
as a signal, we ought to observe an agency response 
to nonpolitical voluntary corporate expenditures. We 
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TABLE 5. Examining the Spectfictty of the Signal: The Value of Corporate Charity 

and Contributions to Oversight Committee Members (Instrumental Variables 

Estimates) 

Explanatory Varable (1)4 

—7.18" 
(5.97) 


Total campaign expenditures 
Contributions to oversight members 
Nonoversight contnbutions 
Corporate charity 

Reactors on site 

Energy capacity (MwE) 

% reactor orrline 

Reactor type 

Population denstty 

Operational age 

Total assets 

ATotal assets 

Plants under operator's control 
Lagged Inspection hours 
Lagged violations 


Intercept 


Plant-operator fixed effects 

Year effects 

N 

F-test 1: Baer — Bnon-over = 0 

F-test 2: Baw + Baon_ae = 0 — 


__Frtest 2: Bower + Broom =O AOR 
Note Absolute value of groupwise heteroscedastcity-conslstent r-statistics In parentheses. *p < 0.05 and "p < 


0.01 (two-talled tests). 


a Estimates derved via multiple Imputaton for left-censored values of corporate charty (50 imputatons). 


therefore gathered data on the corporate philanthropy 
programs and foundations associated with each operat- 
ing company in our sample (source: National Directory 
of Corporate Giving, various editions). 

There is a slight difficulty with the data: Compa- 
nies sometimes report having a program and/or foun- 
dation but only report the disbursements of one (or 
neither). Thus, the data are sometimes left-censored. 
We therefore employ a multiple imputation technique 
that supplements an algorithm suggested by Tanner 
(1993, 42).** Coefficient estimates for the model in- 
cluding corporate charity appear in columns (1) and 


24 After estrmating an initial vector of parameters via Tanner’s EM 
algorithm in the first stage of our IV regression, we generate 50 
imputed data sets by assuming that the missing data are draws from 
a truncated normal distribution. (Using the truncated rather than the 
untruncated normal allows us to exploit our knowledge that censored 
contributions must be at least a certain amount.) We then estimate 50 





(2) in Table 5. Philanthropy has no statistically signif- 
icant effect on the NRC’s inspection activity, lending 
credence to our distinction. 

One must also be careful in specifying precisely 
which political expenditures are most relevant. As dis- 
cussed above, corporate PAC spending is a proxy for 
a broader set of political expenditures. However, to 
the extent that operating companies wish to tap into 
legislators’ ability to manipulate agency incentives, we 
might expect that the agency will respond more when 
they make contributions to legislators whose actions 
are more relevant to the agency: in the case of the 


coefficient vectors and present the mean of those estimates Standard 
errors are denved using the formula derived m Rubm 1987 and 
discussed ın King et al. 2001. Code 1s available upon request. Treatmg 
the charity data as fully uncensored produces substantively identical 
results to those presented here. 
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NRC, those sitting on the House Energy and Com- 
merce Committee and the Senate Committee on Envi- 
ronment and Public Works. On the other hand, because 
operating companies are regulated by many different 
agencies overseen by different committees, it could 
be to the benefit of those companies to reduce the 
specificity of the signal somewhat by investing across 
members of different committees. To examine these 
possibilities, we divided corporate contributions into 
two components: those to members of the relevant 
oversight committees and all others. Estimates appear 
in columns (3) and (4) in Table 5. 

Before examining the results, we note that oversight 
and nonoversight contributions are very highly corre- 
lated (p = 0.86). In the specification with fixed effects, 
this multicollinearity swamps the estimates, eliminat- 
ing the significance of both measures (their sum in 
this specification would still be significant at the 10% 
level applying a one-tailed test). The specification with- 
out fixed effects or lags in column (3), however, does 
provide evidence for the argument that there is more 
signal for the buck in contributions to the committee 
members. An F test of the difference between the co- 
efficients on the oversight and nonoversight measures 
rejects the null at above the 99.9% level. We are some- 
what reluctant to make too much of this finding, how- 
ever. It may be the case that oversight contributions 
are a better proxy for aggregate political expenditures 
than nonoversight contributions, which would help to 
explain why the magnitude of the former’s coefficient 
is so large. At this point, we wish only to point out that 
the distinction between these different types of expen- 
ditures is intriguing, and worthy of greater attention in 
future research. 


CONCLUSION 


In this study, we have argued that political expenditures 
can play a role heretofore unexamined by political 
scientists: that of signals of the willingness to pursue 
one’s policy goals in the political arena if necessary. 
Critically, the effectiveness of the mechanism might 
obviate the need to actually engage in open political 
battles directly. In this application, the sender of this 
information is the corporation, and the potential adver- 
sary receiving the information is the regulatory agency. 
We find evidence that even controlling for a host of 
other factors, the NRC reduces its inspection at nu- 
clear plants whose operators make large contributions 
to political campaigns. We find further evidence that 
operators tend to reduce their political expenditures 
when mishaps at their plants necessitate | 
inspections by the Commission. 

Not all expenditures send the same kind of signal. 
Because elected officials ultimately control both the 
resources available to the agency and the rewards to 
the agency of doing its job, political expenditures tap 
into the incentives of the agency in a way that other 
expenditures would not. This distinction suggests two 
future lines of inquiry. First, additional work is required 
to parse out the effects of different kinds of political 
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expenditures. Second, because legislators play a critical 
role in determining agency incentives, it is desirable 
to incorporate them more explicitly as strategic ac- 
tors in a comprehensive model. Legislators might cali- 
brate the budget and statutory authority of an agency 
partly to maximize campaign contributions. Further, 
because the benefits to firms would accrue at the stage 
of bureaucratic implementation, legislators could es- 
cape blame for outcomes that hurt the welfare of con- 
sumers. 

Although we consider the relationship between firms 
and regulators, the underlying logic of our causal mech- 
anism extends readily to interest group conflict more 
generally. There is no reason to suspect that an inter- 
est group, upon observing costly political expenditures 
by its adversary, would not be in a position to make 
the same inferences about the latter’s willingness to 
fight that a legislator or bureaucrat could. In so do- 
ing, the recipient of the signal would be in a position 
to reassess the viability of its position with respect 
to a particular policy conflict. Alternatively, expendi- 
tures by industry may draw the attention of opposi- 
tion groups and bolster their demands for increased 
regulation. 

Finally, we have also endeavored to articulate and 
test an explicitly political theory of regulation that does 
not rely on notions of bureaucratic capture or side- 
payments (in the form of either future employment 
prospects or bribes) by industry to the agency. In our 
model, regulation provides no direct benefit to the firm: 
All else equal, the firm would prefer that the agency 
disappear. Further, regulators are motivated, possibly 
in part by professional norms, to catch violators. In this 
sense, Our causal story is fully consistent with Wilson’s 
(1989) observation that professional civil servants of- 
ten do not desire future private sector employment, 
and that even if they do, “the kind of work that will 
impress a potential private employer is not necessarily 
that which favors industry but that which conveys evi- 
dence of talent and energy” (86). This is not to suggest 
that agencies do not differ in the nature of the rewards 
available to them. A more extensive comparison across 
agencies, or of a single agency over a more exten- 
sive time period, would foster a greater understand- 
ing of the range of conditions under which the theory 
operates. 


APPENDIX A. FORMAL DERIVATIONS 


Pure Patrol Subgame 

A sequential equilibrium of the game is a behavioral profile 
c*(r), k*(t), m*(c), and a system of beliefs p(t, c) for all t, c, 
such that 


c*(t) c argmaxu*(k*(r), m* (c), T), 
k(x) € arg maxu” (c*(t), m*(c), 1), 
m’(c) € argmaxu“(c, c*(r), K (1)), 


and p (r |c) = [Pr( (1) = 9p (D/U p(t) Pr(c* (t) =c) dt) for 
all c such that f® p(t) Pr(c*(t) = c) dt > 0. We refine the set 
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of sequential equilibria using the concept of universal divin- 
ity (Banks and Sobel 1987), which restricts posterior behefs 
off the equilibrium path. It requires that, after observing a 
contribution that is off the equilibrium path for all types, 
the agency’s posterior beliefs must assign positive probabil- 
ity only to the type(s) for whom that contribution is most 
desirable. 

Let f (k) = y+ pk with ọ <0 and y > —g, and a(m) = 
a+ pm, with a > 0, B > 0. 

Consider first the pure patrol (no alarm) subgame. In a sep- 
arating equilibrium, c(t) is one-to-one, and hence its inverse, 
T(c), exists. In accordance with Bayes’ rule, 


if At) =c, 


1 
pid= l; E ee (A1) 


It follows that 
E[u“(m, k, t)| P(t|c)] = b- (a + Bm)mm 
+ mor(T(c))(1 — k (T(c))). 
The agency chooses its level of monitoring activity m € 
[0, 1] to maximize its expected utility. From the first-order 
condition, 


aF[u*(-)] 


eT a+2fm- AROA — RAKe))) =0, 


=> m{c,c*(t), k(t) 
1 
a7 ESE (T\c))) — a]. 
The firm’s optimal level of compliance, k, is a function of 


its expectations of the agency’s strategy, m*(c), and c. The 
first-order condition is 


(A2) 


WO = a(l — Boot (y+ phm (Oo 
~ axl —mi"()) =0, (A3) 
= kc, m*(c), T) -5 (=e +g- y) . (A4) 


Substituting (A4) into (A2), we obtain the firm’s moni- 
torıng choice as a function of contribution. Collecting terms, 
mic, c*(t)) must satisfy 


A pyr’ + [Zap — r(T(c))(p + Tle) + y)] m+ Tle)r(Tic)) =0. 
(A5) 


Because o < 0 and t > 0 (i.e., compliance 1s never free), m 
is bounded away from zero. If m=1, the left-hand side of 
this equation 1s 4p + (2ap — (T(c))(¢ + y). Recalling that 
g +y > 0, this expression must be less than zero at m=1. 
Thus m ıs also bounded away from one. 

To obtain the firm’s optimal contribution c, we solve the 
first-order condition for u” (.): 


au” () _ a 
a = —1— «(1 — Ay + okt t)z =0. 


By assumption, y+ gk>0,t> 0, and k<1. Thus, either 
&n/ðc > 0 and c* =0 for all t and all k or aen/dc < 0. The 
former would correspond to a poolmg equilibrium; thus 
drm /ac < 0 is a necessary condition for the existence of a 
separating equilibrium. 

To determine when this necessary condition 1s met, we first 
obtain arn*(c)/dc. Because m* is always an interior point in 


(A6) 
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(0, 1), it is responsive to changes in c for all positive contri- 
butions. From the implicit function theorem and (A5), 


am (c) __ (T(c))(1 =m) + (r/C) — m) - (p + y)m] 
ac 8ppm + 2ap — r(T(c))(p+ y+ Tle) 
oT 
* ac 
for all c (i.e., for all t). The denominator is negative (because 


p < 0). To see that d7/dc > 0 if dn*(c)/dc < 0, observe that, 
from (A3), 


(A7) 


— = —aX{1 — m"(c)) < 0, 
and from (A6), 
Pur (-) am 


It follows that for higher types t, the marginal value of con- 
tribution c is higher and that of compliance kis lower. Thus 
dc/dt > 0 and its inverse d7/dc > 0, and 3k/ðr < 0. 

From (A7), dert*(c)/dc < 0 if and only uf 


ATO- m) + ŽITO- m) — (p + y)m] <0. (A8) 


This ınequality constitutes a constraint on the agency’s per 
unit rewards from documenting infractions as a function of 
the firm’s type. When r(r) is such that inequality (A8) is 
satisfied, then drn*(c)/dc < 0. For example, if p + y=0, then 
(A8) is equivalent to (@r/dt) < [r({t)/t] < 0. Because only the 
separating equilibrium is consistent with the posited causal 
mechanism and the empirical results, we focus exclusively on 
the case in which (A8) is satisfied. 

The equilibrium schedule of contributions c*(r) is deter- 
mined by (A6), evaluated at k*(c,t) and m*(c), which is 
equivalent to the following differential equaton: 


~ = a1 = k (c, Dy + pk (c, 1) + 1) 


ov 
, AMe)) 1 ~ m) + dr/de{ Tle) — m) - p + ym] 
r(T{c))(e + y + T(c)) — 8Bpm — 2ap 
(A9) 


The satisfaction of (A6) for all t, and hence the satisfaction 
of (A9), 18 necessary and sufficient for the existence of a 
separating equilibrium (Mailath 1987). Sequential rationality 
dictates that t; choose c=0 im any separating equilibrium, 
since if it were to choose a strictly positive contribution, 
it would nonetheless be monitored the same amount. This 
condition identifies a unique solution to (A9) and, hence, 
the unique separating sequential equilibrium behavior. Since 
only the separating equilibrium behavior can be supported 
by beliefs that satisfy universal divinity, the strategy profile 
identified by (A9), (A4), and (A5) is the unique universally 
divine behavior. 

All contributions c < c(t) occur on the equilibrium path 
of play. For all higher contributions c > c(t), Ta suffers less 
disutility from c than does any other type. Hence, universal 
divinity requires that 


if T< Th, 


paiese $ i 


The agency will then choose a level of monitoring 
m= m" (c*(Ta)). Although this corresponds to the lowest level 
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of monitoring chosen in equilibrium, the firm is better off 
choosing c* (Ta), since it would then pay a lower contribution 
to achieve the same low level of monitoring. We have already 
established that a firm of type t Æ Ta is better off choosing 
c*(r) than c*(t,). 


The Alarm Subgame 


The constraint m > implies that the highest types may pool 
at some é in equilibrium. The posterior beliefs are 


1 if  ce(t)=¢c<ê, 
p(tlc)= Ey if c(t)=c=2, (A10) 
0 f 8 c(t)#c 


Note that ıf m and thus ĉ are not binding, then (A10) and 
(A1) are equivalent. Then 


E[u“ (m, k, t)| P(t|c)] 
b — (a + pm)mo + mær(T(c)) 


x (1 — K (TCc))) ce[0, 2) 
= jo- (a + Bm)\mw + more 
x f POAC — k (t)) dt if c>é, 
and i 
m* (c, k (t)) 
A[r(T(c))(—K(T))) — a] E aia 
BE E: frre -E(t -a if si 
(A11) 


As in the pure patrol subgame, &*(t, m*(c)) is given by 
(A4). Substituting into (A11), we obtain the following de- 
finition of m* (c): 

4 pom’ + [2ap — r(T(c))(p + Tle) + v)]m 

+ Tie)r(TXc)) =0 if 


4 Ber? + aap - a) f POOE +y +1) a| m 


ce[0, 2), 


c>ĉ. 


+ FO POD 0. if 


Because the agency is constrained to choose m > M, m*(c) = 
M for c > ĉ. Because M is exogeneously determined, the last 
equation determines ?, which was defined to be the lowest 
type that chooses contribution ĉ. 

For contributions between zero and 2, the agency’s re- 
sponse is identical to its response in the pure-patrol subgame. 
Likewise, c*(t) is the same in both subgames for all t < % 
but in the alarm subgame, c*(r) =ĉ for all tr > ?. As demon- 
strated in the analysis of the pure patrol subgame above, 
the level of monitoring m*(c) is decreasing in contributions 
c < é@, contributions c*(r) are increasing in the firm’s cost 
of compliance rt E (te, 7), and the firm’s level of compliance 
k" (t) ıs decreasing in its cost of compliance t € (t, ?). 


APPENDIX B. FULL LIST OF DATA SOURCES 


Inspection hours and violations per year: Nuclear Regulatory 
Commission, Regulatory Information Tracking System. 
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Plant operations: International Atomic Energy Agency, 
Power Reactor Information System. 

Scrams per year: Idaho National Engineering and Environ- 
mental Laboratory. 

Plant operating companies, by year: Nuclear Energy Institute. 

Operating company campaign expenditures: Federal Elec- 
tion Commission. 

Static plant characteristics: Department of Energy, Energy 
Information Administration. 

Operating company financial data: Thompson’s Research/ 
SEC Disclosure; Mergent online; SEC Edgar database; 

individual company homepages. 

Corporate philanthropy data: The Foundation Center, 
National Directory of Corporate Giving, various editions 
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Policy and the Dynamics of Political Competition 


MICHAEL LAVER New York University 


is paper proposes a model that takes the dynamic agent-based analysis of policy-driven party 
competition into a multiparty environment. In this, voters continually review party support and 
switch parties to increase their expectations; parties continually readapt policy positions to the 
shifting affiliations of voters. Different algorithms for party adaptation are explored, including “Aggre- 
gator” (adapt party policy to the ideal policy positions of party supporters), Hunter (repeat policy moves 
that were rewarded; otherwise make random moves), Predator (move party policy toward the policy 
position of the largest party), and “Sticker” (never change party policy). Strong trends in the behavior 
of parties using different methods of adaptation are explored. The model is then applied in a series of 
experiments to the dynamics of a real party system, described in a published opinion poll time series. 
This paper reports first steps toward endogenizing key features of the process, including the birth and 
death of parties, internal party decision rules, and voter ideal points. 


journalists, pundits, novelists, country specialists, 

or indeed practicing politicians—usually describe 
a system in perpetual motion. Perpetual motion is seen 
as a normal state of affairs not a manifestation of 
chaotic instability. And it is usually seen as having an 
endogenous dynamic; what the actors do at cycle c of 
the political process feeds back to affect the entire pro- 
cess at cycle c + 1. These informed discussions thus see 
politics as a complex dynamic system evolving under 
its own steam, a system unlikely to reach steady state. 

In stark contrast, mainstream models of political 
competition are usually static, with key model param- 
eters and rules of interaction fixed exogenously. Al- 
though static models do not necessarily imply equilib- 
rium, almost invariably the core intellectual approach 
is to specify a model and solve for equilibrium. Authors 
of such models are typically driven by what Cederman 
describes as a “metaphysical conviction that equilib- 
ria are always ‘out there’ waiting to be discovered,” a 
conviction that can “blind the analyst to the possibility 
of adjustments never settling” (Cederman, 1997, 34- 
35). But if common sense tells us party competition is 
a complex dynamic process, then it also tells us party 
competition may never achieve equilibrium. 

This paper addresses the striking disjuncture be- 
tween the static nature of most formal models of party 
competition and the dynamic way in which informed 
observers, who hardly ever talk about politics in terms 
of equilibrium, think about the political process. While 
many theorists would willingly concede in general 
terms that real political competition is a complex dy- 
namic process, the hard question concerns how to 
model this in a way that is both theoretically tractable 
and substantively plausible. In what follows, I set out 
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to answer this question by specifying a complex dy- 
namic model of political competition, by interrogating 
the model using techniques of agent-based modeling, 
and by calibrating the model to a real world example 
of dynamic political competition. To the extent that 
this undertaking is successful, it paves the way for 
an approach to modeling the complexities of political 
competition in multiparty systems that more closely 
matches the intuitions of informed observers of real 
politics. 

To get a substantive feel for the type of system I will 
be exploring, consider Figure 1, which plots quarterly 
opinion poll support for the main Irish parties between 
1986 and 1997.1 This picture will be familiar both to 
Irish politicians and to informed commentators on Irish 
politics; similar pictures can be drawn for any other 
competitive party system and convey huge amounts 
of information about party politics For example, 
Figure 1 shows us that Fianna Fáil (FF) was always the 
largest party. Indeed it was so far above the second- 
largest party, Fine Gael (FG), that even considerable 
fluctuations in its support” left FF comfortably at the 
top of the system, continuously flirting with the 50% 
vote share that would give it a legislative majority.” 
Figure 1 shows FG entrenched as the second-largest 
party—never challenging FF but challenged itself only 
once, in late 1992, by spiking support for the Labour 
Party (Lab). We see this spike in Labour support—won 
at the expense of FF and immediately followed by 
the first ever government coalition between FF and 
Labour—set in the context of a gentle secular growth 
in Labour support. We see a decline in support for the 
Progressive Democrats (PDs), while Democratic Left 
(WP/DL) bumped along on the bottom of the system. 
We can thus describe Irish party competition in terms 
of the relative sizes of the parties at any point in time, 
of fluctuations and trends over time in the sizes of each 


1 This was after the formation of the Progressive Democrats (PDs) 
and before the merger of Labour (Lab) and Democratic Left 
(WP/DL), dunng which Ireland was effectively a flve-party system. 
Figure 1 and results that follow report party shares of support for the 
five main parties 

2 From just over 40% to just under 60%. 

3 Under Ireland’s more or less proportional STV electoral system. 
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party, and of which party loses support when some 
other party gains it. 

Most specialists in Irish politics would be delighted to 
talk us through the politics of the complex interactions 
that generated Figure 1. Specialists in the politics of 
other countries, likewise, make their livings analyzing 
their own country-specific versions of Figure 1. But 
most models of party competition, confronted with 
the type of moving system reported in Figure 1 and 
forming the bread and butter of any country specialist, 
could do no better than describe it m terms of random 
noise superimposed on a steady state. ‘This large lacuna 
motivates what follows, in which I present a model that 
describes the complex dynamics of a multiparty system 
in continuous motion. My core intellectual mission is 
to charactenze the movement of this system over time, 
not to solve for some equilibrium state there is no 
a priori reason to expect. 


EXISTING DYNAMIC MODELS OF 
POLICY-BASED PARTY COMPETITION 


The main departure from traditional static theories of 
party competition can be found in an emerging liter- 
ature that models two-party competition between in- 
cumbent and challenger as an evolving complex sys- 
tem. This followed work by Kollman, Muller, and Page 
(1992, 1998)—hereafter KMP—and assumes that a re- 
alistic account of policy-driven party competition de- 
scribes a complex and evolving decision-making envi- 
ronment where key actors have very incomplete and 
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imperfect information. KMP use a paradigm that in- 
trinsically reflects this low-information dynamic envi- 
ronment, rather than tinkering with a traditional static 
spatial model based on the assumption of hyperra- 
tional decision-making with high levels of information. 
This leads them to “artificial adaptive agent,” or agent- 
based, models. The key distinction between paradigms 
is that hyperrational agents make choices by looking 
forward strategically, continuously solving and resolv- 
ing in real time the dense systems of equations in 
the high-end rational choice literature. Note that even 
these equations describe a spectacular reduction in the 
complexity of any real decision-making environment. 
In contrast, adaptive agents look backward and learn 
from the past, developing simple rules of thumb that 
condition future behavior on the recent history of the 
tem. 

We would all love to enjoy the best of both 
paradigms, but only agent-based models currently offer 
the practical possibility of modeling the endogenously 
evolving dynamic systems described by most informed 
observers of real politics. This is because agent-based 
models of party competition: work well with levels of 
information and rationality that are not “unrealisti- 
cally” high; describe parallel recursive interactions be- 
tween large numbers of agents, interactions that gener- 
ate a complex system; allow different agents to deploy 
different behaviors that can tractably be pitted against 
each other within the model; treat agents as discrete 
decision-makers with discrete ideal points; allow the 
effects of model parameters to be explored systemat- 
ically and rigorously. To be sure, agent-based models 
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are simulations that do not allow us to prove theorems 
in the style of classical game theory. But, in a context 
where the types of theorem classical game theorists can 
currently prove are at odds with the common-sense 
view of politics as a moving system unlikely to reach 
steady state,* agent-based models do allow us to get a 
feel for how gurl systems operate in a range of plausible 
circumstances.’ 

KMP (1992, 1998) presented an agent-based dy- 
namic model of two-party competition in a multidi- 
mensional issue space, setting the baseline for subse- 
quent work and citations in the field, almost all of which 
retain a U.S.-oriented focus on two-party incumbent- 
challenger systems (see, for example, de Marchi 1999, 
2003). KMP’s underlying spatial characterization of 
policy preferences is the same as that of the traditional 
spatial model, although their computational implemen- 
tation assumes that agents adopt one of a small number 
of possible positions on a finite set of issue dimensions 
and, thus, uses a discrete policy lattice rather than a real 
policy space. 6 They follow the traditional spatial model 
in assuming that voters are both policy-motivated and 
well informed about the policies of political parties, 
supporting the party that maximizes their utility. How- 
ever, KMP treat voters as discrete agents; their result- 
ing description of voter preferences as a finite scatter 
of ideal points differs radically from traditional spatial 
models, which typically use smooth density maps im- 
plying an infinite number of voters. This more realistic 
description of voter ideal points requires a fundamen- 
tal change in analytical approach; the calculus used by 
traditional spatial theorists to analyze smooth density 
maps is replaced by the numerical analysis required to 
analyze finite sets of discrete points. 

KMP depart from the traditional model in assuming 
party leaders are not perfectly informed—either about 
the utility functions of every single voter or about the 
probability functions associated with these. Instead, 
they gather information from limited (and private) 
opinion poll or focus group feedback during election 
campaigns, using this to adapt policy positions incre- 
mentally from a given starting point.’ KMP’s original 
model (KMP 1992, 1998) designates one of two parties 
as incumbent, constrained to stick with its current pol- 
icy position. The other is designated challenger and set 


4 Exponents of nonclassical “evolutionary” game theory may even- 
tually prove important theorems about policy-driven party compe- 
tition in a dynamic system. But a recent sophisticated presentation 
of the cutting edge of this field (Cressman 2003) implies that there is 
still a long way to go in this regard. 

> Agent-based models of politics have recently been used to explore, 
miter alia, evolution of cooperation among selfish actors (Axelrod 
1997; Bendor, Diermeser, and Ting 2003; Skyrms 1996), evolution of 
the mternational state system (Cederman 1997), migration between 
policy juriscichons (Kollman, Miller and Page, 2003), rotating pres- 
idency of the European Council (Kollman 2003), and emergence of 
civic traditions in modern Italy (Bhavnani 2003). 

6 The orginal KMP model has 15 dimensions, with seven possible 

tions on each. 

KMP do not explicitly clam that this feedback 1s private informa- 
tion, but this is implicit in the process they describe; parties test a 
battery of counterfactual policy modifications on a sample of voters 
and adopt only modifications that increase voter support. This is hard 
to envisage on the basis of public opinion polling 
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the task of searching for information on voter prefer- 
ences, using this to adapt its policy position to increase 
support. Three search algorithms for parties are investi- 
gated by KMP? but the most important feature of their 
results is that, regardless of search strategy (KMP 1992) 
and spatial distribution of voter preferences (KMP 
1998), the two party platforms systematically converge 
over a series of elections to positions that are centrist 
yet distinct. More “rugged,” less smooth, profiles of 
voter preferences slow down party convergence on 
the center but do not change the strong tendency for 
this to happen. de Marchi (1999) modified this model 
by allowing incumbent as well as challenger to adapt 
policy positions, assuming incumbents to have greater 
resources and, again m a U.S. context, that incumbents 
need not expend resources on primary campaigns. As- 
suming that voters have fixed budgets for information 
gathering and deploy these on issues in which they have 
more interest, the key finding is that, the more rugged 
the electoral landscape, the greater benefit to the in- 
cumbent, who, by assumption, has more resources to 
deploy in the search for votes? 

KMP (2003) extended their two-party incumbent- 
challenger model to multiparty competition, in work 
that raises questions about taking their search algo- 
rithms out of a two-party context. In an incumbent- 
challenger setting, testing counterfactual challenger 
positions against a fixed incumbent seems intuitively 
plausible. ‘The multiparty KMP model, however, gives 
each party an exclusive “turn” of eight iterations of 
this process, holding the positions of all other parties 
fixed, before the turn passes to another party (196). 
The entire set of party “turns” is repeated five times 
during an election campaign. Setting aside the possible 
sequencing effects of these “turns,” the key question 
concerns the substantive interpretation of what is go- 
ing on. KMP report results for systems with up to seven 
parties. In this context the generic counterfactual ques- 
tion posed by each ae in private polling would take 
the form, “Assuming all six other parties retain their 


8 A “random adaptive party” generates a finite set of counterfactual 
platforms in the neighborhood of its current platform and adopts 
the alternative attracting most support. A “climbing adaptive party” 
makes a sequence of small counterfactual modifications to its policy 
position, experimenting with individual small shifts, implementing 
any shift that increases support, and iterating this process. A “genetic 
adaptive party” uses a version of the genetic algorithm developed 
within evolutionary biology Its policy package ıs described by a 
vector of seven possible policy positions on 15 different issue di- 
mensions and treated as equivalent to 1ts “policy DNA.” A finite set 
of counterfactual policy vectors is generated and subjected to the 
genetic operators of reproduction, crossover, and mutation, with the 
resulting evolved positions adopted if they increase party support. 
This process is iterated a set number of trmes during a campaign 

? de Marchi (2003) subsequently refined assumptions about voters. 
“Highly sophisticated” voters have “constrained” issue positions— 
position on one dimension can be predicted from positon on 
another—and report their preferences to pollsters relatively accu- 
rately. Other voters display “less interest or aptitude m politics” 
(148). Ther positions on each issue are derived from indepen- 
dent uniform distributions and reported to pollsters with high van- 
ance around “true” positions Results confirm expectations that the 
greater the uncertainty with which opmion polls pick up the ideal 
points of “unsophisticated” voters, the more meffective party search 
strategies 
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current policy packages, and assuming we move our 
own policy package from x to x + ô, which party would 
you support?” A battery of similar questions generat- 
ing systematic sweeps of ô would need to be asked by 
each party at each cycle of adaptation in order to identify 
the best policy move from a given position. This seems 
to be a complicated and substantively unrealistic way 
for parties to gather information. Transplanting their 
search algorithms into a multiparty context, KMP may 
be assuming that party leaders are able to read far 
richer information off the system than would conceiv- 
able in any real-world context. Furthermore, it is not 
clear that multiparty competition is realistically mod- 
eled by giving each party a chance to adapt its policy 
in a well-sequenced series of “turns,” during each of 
which all other parties simply watch what is happen- 
ing. Actual policy moves may well be made in some 
kind of de facto sequence, but it seems implausible to 
imagine parties sequencing their information-gathering 
activity in this way. Yet this sequencing is mandated by 
the counterfactual questions to voters the KMP search 
algorithms need to ask; these algorithms may thus not 
adapt plausibly to a multiparty context. They require 
parties to have much less information about voters than 
the traditional spatial model, but impose what may still 
be unrealistic requirements on information-gathering 
activities by parties, as well as unrealistic sequencing 
assumptions. For these reasons, an alternative set of 
adaptive algorithms is explored below. 


A NEW AGENT-BASED MODEL OF 
MULTI-PARTY POLITICAL COMPETITION 


Motivational Assumptions 


Following both the traditional spatial model and KMP, 
I classify political agents into two types or “breeds,” 
party leaders and voters.” Voters are assumed to be 
intrinsically interested in policy and to have ideal points 
in a real policy space, the dimensions of which are 
directly analogous to those in the traditional spatial 
model.!! Again following the traditional model, party 
leaders are assumed, with one exception, to have only 
an instrumental interest in policy, nonetheless com- 
peting with each other by offering policy packages to 
voters. 


Party System Dynamics 


Initiation of the model, as with KMP, randomly scat- 
ters a discrete set of party positions and supporter 
ideal points across the policy space. Voters initially 
support the party with the policy position closest to 


10 Thus this is not a model of the purely endogenous evolution of 
political partes within a “single breed” population of voters—for an 
example of such a model see Schreiber 2002. 

11 To aid visualrzation, this version of the model is implemented ın 
two dimenmons, but the model can be implemented in any number 
of dimensions. 

12 The model currently uses a random draw from a normal distnibu- 
tion with a mean of zero and a standard deviation one third the radius 
of the space There is no reason why specific assumptions or data 
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their ideal point. Once voters have supported par- 
ties, party leaders adapt their policy positions to re- 
flect the pattern of party support by voters. They use 
one of several decision rules to do this (see below). 
Once party leaders have adapted their policy posi- 
tions, voters readapt and once more support the clos- 
est party, switching parties if this has changed. Par- 
ties now readapt to the new configuration of voter 
support and the process iterates continuously. Voters 
readapt their party support in the light of each new 
profile of party policy positions; party leaders readapt 
their policy positions in the light of each new profile 
of voter support. Once in motion, the process never 
stops. 

Thus this model does not have the “election cam- 
paigns” within which the action in the KMP model 
is confined. Political dynamics is seen as a continuous 
process that runs all the time, between elections as 
well as in the run-up to these. For those who want to 
know something about election campaigns, in addition 
to the continuous evolution of political competition, 
snapshots or short movies can be taken of this process 
at scheduled intervals, for systems with fixed-term leg- 
islatures, or at intervals generated by some probabilis- 
tic model of endogenous election scheduling that suits 
the purposes and tastes of the investigator. It is impor- 
tant, however, if we seek a realistic dynamic model of 
political competition, that the baseline model describes 
a process that never stops rather than something set in 
motion only during election campaigns. 


Adaptive Decision Rules for Party Leaders 


Party leaders are assumed to use one of a number 
of adaptive decision rules to set party policy at any 
given cycle of the process, These rules are grounded in 
received wisdoms about types of intraparty decision- 
making regime. The essential distinction is between 
what we might think of as a “democratic” party, in 
which leaders adapt party policy to the preferences of 
current supporters, and some form of “cadre” party, 
in which leaders adapt policy according to their own 
interests—for example, maximizing votes by appeal- 
ing to potential supporters. Kitschelt (1994) describes 
this distinction as one between “innovation from be- 
low,” in which a party seeks to “diversify its appeal 
and represent popular debates within the microcosm 
of the party,” and “‘innovation from above,’ whereby 
party leaders act autonomously from a party’s inter- 
nal process of interest aggregation” (212). The latter 
is “particularly important in situations where...small 
changes its strategic appeal my result in great differ- 
ences in its success in vote seeking...” (213). Müller 
and Strøm (1999) make the same distinction. They con- 
trast regimes, often within traditional social democratic 
parties, where “party members... constrain the party’s 


on party and supporter policy positions could not be read mto the 
model as starting configurations; this 1s indeed done below when the 
mode] ıs applied to a real party system. In this context, note KMP’s 
(1998) important finding that many key features of competition do 


not depend on the configuration of supporter ideal points 
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leading representatives to follow a clearly defined 
policy course” with regimes, as in the post-1979 PSOE 
in Spain, where members have “no influence on the 
policies of the party leadership” (292-93). 

The first adaptive rule for party leaders thus reflects a 
party in which the leader’s freedom to set policy is con- 
strained by the policy preferences of current party sup- 
porters. This rule—which incidentally cannot be found 
in the traditional spatial model, in KMP, or indeed in 
any other formal model of party competition of which I 
am aware—is AGGREGATOR. Under AGGREGA- 
TOR, leaders set party policy at the mean position on 
each dimension of the ideal points of current party 
supporters. They continuously readapt this position as 
the party’s support profile changes.» 

The second adaptive rule models an “unconstrained” 
party leader who constantly modifies party policy in 
the search for more supporters. Unlike the KMP search 
strategies that impose heavy information requirements 
in a multiparty context on leaders trying to evaluate 
prospective counterfactual policy moves, the rule de- 
fined here is intrinsically adaptive, conditioning the di- 
rection of a policy move at cycle c on the success or 
failure of the previous move, made at cycle c—1. The 
rule is called HUNTER and searches for support us- 
ing a simple Pavlovian “win-stay, lose—-shift” algon 
found very effective by Nowak and Sigmund (1993).! 
If the previous unit policy move increased party sup- 
port, measured using a common-knowledge published 
opinion poll of party support levels, HUNTER makes 
another unit move in the same direction. If the previ- 
ous move did not increase support, HUNTER makes 
a unit random move in the opposite direction, turning 
to face the opposite direction from the previous move 
and making a unit move in a direction chosen randomly 
within the half-space toward which it now faces. In 
addition to the success of this type of this adaptive rule 
in previous theoretical work, it does seem a substan- 
tively plausible way for a party leader to respond to 
the limited feedback provided by the cut and thrust of 
real politics. “If what I did the last time worked, do it 
again; if it didn’t work, back away and cast around for 
something new that does work.” 

An alternative adaptive rule for unconstrained party 
leaders is used here as an alternative to simple Pavlo- 
vian learning and exploits a different piece of freely 
available information party leaders might use to in- 
crease their support. This is the location of the largest 
party in the system and the rule is called PREDATOR. 
A PREDATOR observes the current sizes and policy 
positions of all parties at cycle c—1. If it was not the 
largest party, it makes a unit move at cycle c toward the 
position of the largest party. If it was already the largest 


B An alternative interpretation of AGGREGATE is a party that 
is a leaderless endogenous coalition of supporters, as envisaged by 
Schreiber (2002). 

14 A similar “win-stay, lose-shift” rule was plemented by Bendor, 
Dtermeier, and Ting (2003). 

15 HUNTER makes a unit move in a random direction on the first 
cycle of adaptation, since there m no previous move to evaluate. 


party, a PREDATOR stands still.!° Again, this seems 
a substantively plausible way for a real party leader 
to respond to a low-information environment. “I don’t 
know much about the precise locations of voter ideal 
points, but I do know more voters are located close to 
the largest party than are located close to me.” 

Partly to provide a static baseline against which 
adaptive decision rules can be evaluated, but also to 
model an ideological party leader concerned exclu- 
sively with maintaining a particular policy position and 
not at all with increasing party support, the final deci- 
sion rule used here is STICKER. A STICKER never 
changes policy position, regardless of the ideal points 
of voters and the positions of other parties. This rule 
is not itself interesting in dynamic terms but becomes 
significant when interacting with other decision rules. 

Figure 2 summarizes the model and its adaptive 
strategies, which was programmed by the author in 
NetLogo 2.0.2." As can be seen, once the setup phase 
is complete, the baseline model runs forever. If de- 
sired, however, party leaders and voters can be set up 
at predetermined locations and the model can be set to 
run for a predetermined number of cycles, to simulate 
the evolution of the system over a given period from a 
given starting point. 


BEHAVIOR OF SIMULATED PARTY 
SYSTEMS UNDER SINGLE DECISION RULES 


Hunter 


The most striking generic result in this paper is the 
success of the adaptive HUNTER algorithm in finding 
dense areas of party support. Hunters use no infor- 
mation whatsoever about the global geography of the 
policy space. They know only that voters support the 
closest party, how many supporters they had during 
the previous two cycles, and which direction they 
moved between these cycles. They have no knowledge 
of the ideal point of any voter, or of the position and size 
of any other party. Hunters use limited feedback from 
their local environment and apply it recursively, but this 
still allows them to pick up effective clues about the best 
policy direction in which to move. Figure 3 shows the 
trajectory of a typical simulated party system with three 
Hunters, each artificially started at the edges of a policy 
space with 1,000 randomly scattered voters. The plots 
show each party leader relentlessly homing in on higher 
support densities at the center of the policy space. It is 
important to remember when looking at Figure 3 that, 
unlike KMP’s Climbing Adaptive Parties, Hunters do 
not use an “uphill” search algorithm (backed up by 
sophisticated private polling about local sets of coun- 
terfactual policy positions) to find higher support densi- 
ties. They simply adapt to rewards and punishments re- 
ported in common-knowledge published opinion polls. 


16 In a two-party incumbent—challenger context, a predating chal- 
lenger would sunply move directly toward the known policy position 
of the incumbent, with no need to do private polling 

17 All programs are freely available upon request from the author. 
Key elements of the NetLogo code can be found in the Appendix 
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FIGURE 2. Overview of Dynamic Model and Its Adaptive Rules 


SET-UP 


Create v voters and p party leaders. Manually assign decision rules to party leaders 


Automatically assign 2-dimenslonal policy positions to all agents, drawn from random 
normal distrbuton with mean 0 and standard deviation 1/3 radius of space 


VOTERS ADAPT 
Ask voters to support the closest party. 


SYSTEM UPDATE 
Count voters supporting each party 
PARTY LEADERS ADAPT 
Ask leaders to use assigned decision rule to adapt party position 


ADAPTIVE DECISION RULES 


AGGREGATOR 
Go to mean postton of current party supporters on each dimension. 


HUNTER 
Was previous move followed by increased party support? If yes, repeat move 
Hf no, turn 180° from direction of last move, make unit move In direction 
randomly sefected from arc 90° elther side of direction now faced. 


PREDATOR 
Observe party sizes If you are the largest party, stand still 
If not the not largest party, set heading towards largest party, make unit move 


STICKER 
Never change pollcy position 


Large numbers of simulations, run with up to 12 parties, 
always showed the same pattern.!® HUNTER’s in- 
cremental adaptive moves thus converge in a low- 
information agent-based environment on the behavior 
that would be predicted by a traditional spatial model 


18 There is one crucial but obvious exception. In a one-party system 
with a lone Hunter, party support is unaffected by policy position, 
and HUNTER can take any position in the space. 
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assuming hyperrational strategic agents with perfect 
information about all voter ideal points. 

Figure 3 also hints at strong pattern observed in 
many simulations of party systems where all parties 
are Hunters. While Hunters systematically move toward 
the center of the policy space, they tend to avoid tts 
dead center. After any given cycle in a mature party 
system, the party configuration is typically a scatter of 
Hunters around, but definitely not at, the center. ‘This 
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FIGURE 3. Three Hunters Finding High Support Densttles 


is illustrated in Figure 4, which summarizes the results 
of an experiment involving 100 independent 500-cycle 
runs of a system with seven Hunters and 500 voters. 
To concentrate on the configuration of the “mature” 
system, the effects of the random starting configuration 
were “burnt off” by discarding output from the first 150 
cycles of each run, before output for a 500-cycle run 
was recorded. Figure 4 shows a histogram of distances 
between one of the Hunters and the origin, measured in 
standard deviation units of the voter distribution (SD 
units), during the full 50,000 cycles of the experiment.!° 
This picture tells a simple but striking story, showing 
how rare it is for Hunters to go to the dead center of 
the policy space. In this seven-party case, Hunters are 
typically found about 0.85 SD units from the center of 
the space. 

Table 1 shows results from a suite of experiments 
designed to characterize Hunter distances from the 
center of the policy space as a function of the number 
of parties in the system.” Just as a Hunter in a one- 
party system can roam anywhere in the policy space, 
two-Hunter systems are also highly atypical. The two 
Hunters go to the center of the space and attack each 


9 Entirely similar patterns obtain for all other Hunters in the simu- 
lation. 


® The seven-Hunter case in Table 1 is described in the previous 
paragraph. Experiments for party systems of other sizes were based 
on 10 trials of 1,000 cycles, with parties and 500 voters rescattered at 
the start of each trial and the first 150 cycles discarded as a burn-off 
phase. 





A Hunter 1 


+ Hunter 2 


at) v Hunter 3 


others’ support bases head to head—very much along 
the lines predicted for two-party competition by the 
traditional spatial model. With three or more Hunters, 
however, the Hunters tend to search for votes away 
from the dead center of the policy space. In a three- 
Hunter system, Table 1 shows that Hunters typically 
search for votes about 0.51 SD unit from the origin. 
Since the standard deviation of their distances from 
the origin is about 0.24 SD unit we see that, even in 
a three-Hunter system, Hunters only rarely go right 
to the origin. Typical Hunter distances from the origin 
increase with party system size, though in party systems 
with five or more Hunters, we can be confident that the 
Hunters will search for votes in a donut-shaped region, 
at a mean distance of between 0.75 and 0.95 SD unit 
from the origin. Standard deviations show that Hunters 
only rarely visit the origin, despite the fact this is the 
location of the highest vote densities. 

The behavior behind the patterns in Table 1 is eas- 
ily understood if the system is watched in motion. If 
any party goes to the center of a system with several 
Hunters, it becomes a rewarding source of votes for 
other Hunters, who move toward it and feed off its 
support base. The party at the center is thus punished 
by losing support, despite being at the highest density 
point in the space, and responds to this punishment by 
moving away from the center. This emergent “coordi- 
nated” behavior by the other Hunters arises despite 
the fact that, in contrast to more traditional game the- 
oretic models, no Hunter knows any other party ex- 
ists, showing the complex way in which several Hunter 
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FIGURE 4. Distribution of Hunter Distances from the Origin 
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Mean Hunter Distances from Origin In Two- to 10-Party Systems 


Distance from origin (SD Units) 


Mean SD 
20,000 0.19 0.11 
30,000 0.51 0.24 
0.68 0.25 
0.76 0.26 
0.79 0 27 
0.85 0.29 
0.88 0.30 
0.92 0.33 
0.93 0.34 


Note. Distances measured m terms of standard deviaton (SD) units of the voter distribution. 


algorithms interact. In effect, systematic information 
about the behavior of all other parties is encoded in 
the sequence of rewards and punishments experienced 
by a Hunter. One consequence of this is that a Hunter 
finding itself at the center of the party system tends to 
get out as fast as it can. In general substantive terms, of 
course, we note that it is empirically rare to find large 
parties at the dead center of multiparty systems—a 
theoretical problem for the traditional spatial model 
recently discussed by Schofield (2003). 


Aggregator 


While AGGREGATOR may not at first seem to be an 
intrinsically adaptive decision rule, it generates party 
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adaptation in the context of a dynamic multiparty sys- 
tem. A set of voters supports an Aggregator at cycle c 
because it is the closest party to them. The intrinsically 
“democratic” Aggregator goes to the mean policy po- 
sition of its supporters but all other parties adapt their 
positions at the same time. Following this, some voters 
may wish to switch parties, causing the set of voters 
supporting both losing and gaining parties to change. 
Each switch to or from an Aggregator, quite possibly 
arising from the actions of other parties, changes the 
set of voters whose views must be taken into account 
and thereby provokes a shift in its policy position.”" 


21 Since a voter located at the current party position would never 
switch. 
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This policy shift could easily provoke at least one other 
voter to switch, provoking other policy moves, pro- 
voking other party switches, and so on, ad infinitum. 
All-Aggregator party systems could thus move contin- 
uously, while any Aggregator adapts to vote switching 
generated by neighboring parties that use different de- 
cision rules. Votes lost to a rival on one side of a policy 
dimension, for example, provoke a policy adaptation 
in the other direction as the policy center of gravity of 
the party changes. 

One strong result from many simulations is that a 
steady state always emerges in a party system where 
all parties are Aggregators, an unanticipated and strik- 
ing pattern. We might conceivably devise pathological 
configurations of voter and party positions where the 
system is always in motion; but we can confidently 
conjecture that, for an arbitrary scatter of discrete 
voter ideal points and party positions, a steady state 
will be achieved after a finite number of cycles of an 
all-Aggregator party system. Kigure 5 illustrates this 
systematically. The top panel shows the pattern of evo- 
lution to steady state in a 10-Agpregator party system 
with 500 voters, summarizing results of an experiment 
with 200 independent 75-cycle trials, each with a ran- 
dom draw of party and voter starting positions. For any 
given cycle, it shows the proportion of trials for which 
the party system was still adapting. In about half of 
the simulated party systems, party support levels flat- 
lined in steady state before 20 cycles of the model; in 
this experiment, the longest adaptation to steady state 
from a random start took 43 cycles. 

We can conjecture that steady states emerge in 
all-Ageregator party systems, instead of the unend- 
ing adaptation and readaptation we might expect, be- 
cause any discrete scatter of voter ideal points has 
nonuniform random clusters of voters that provide lo- 
cal attractors for Aggregating parties. Such a discrete 
scatter has a “granular” quality, providing a source 
of friction in the system quite distinct from the con- 
ventional spatial model’s smooth density map of ideal 
points, which describes a friction-free world with an in- 
finite number of agents. The bottom panel in Figure 5 
throws systematic light on this, in a series of experi- 
ments (Models 1-3) on all-Aggregator systems, each 
with five parties but with from 250 to 1000 voters.~ As 
the number of voters decreases, so the granularity of 
any random scatter of voters must increase—reaching 
extreme granularity when there is just one voter. Each 
cell in the table shows the percentage of trials for which 
the party system was still in motion after a given num- 
ber of cycles. First, note that every one of the sim- 
ulations reaches steady state before 55 cycles have 
elapsed. Second, comparing results for Models 1-3 
supports the expectation that, the smaller the number 
of voters and the more granular the scatter of ideal 
points, the faster the party system reaches steady state. 
After 20 cycles of the system, 94% of 250-voter systems 
had flat-lined in steady state, compared to only 57% 
of 1,000-voter systems. Model 4 summarizes the 10- 
Aggregator experiment reported in the top panel in 


2 Rach experiment involved 200 independent 75-cycle runs. 
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Figure 5; comparing this with Model 2, we see that, in 
contrast to the granularity of the scatter of ideal points, 
the number of Aggregating parties appears not to make 
a difference to the speed of convergence to steady 
state. 

Such steady states, while quickly reached, are easy 
to perturb. If each voter is sent on a one-unit random 
walk after a steady state has emerged, for example, 
Aggregating parties readapt to different positions and 
sizes. This adds to the intuition that it is local and easily 
perturbed granularities in the voter distribution that 
attract particular steady states of party positions. Thus, 
while we confidently predict a steady state in any given 
all-Aggregator party system, it is impossible to pre- 
dict precise party locations in this. What we can confi- 
dently predict, however, is that Aggregating parties will 
spread themselves evenly over the surface of the policy 
space. Always located at the center of its supporters, 
a “democratic” Aggregating party never “attacks” the 
support base of any other party, but adapts passively 
to the changing composition of its support base by 
locating itself at the center of this. The result is an 
even distribution of parties across the surface of an 
all-Ageregator party system. 

All-Sticker and all-Predator party systems are triv- 
ial or pathological. Party leaders in all-Sticker systems 
by definition never move. In all-Predator systems, all 
agents end up writhing around on top of each other in 
the center of the space—a result not unlike that of the 
unreconstructed traditional spatial model. Neither of 
these looks remotely like a real party system, in which 
parties typically do not converge on the center, a prob- 
lem recently highlighted by Schofield (2003). 


COMPETITION BETWEEN PARTIES USING 
DIFFERENT DECISION RULES 


One benefit of using agent-based models of party com- 
petition is that it is easy to observe how different types 
of parties fare when competing against each other in 
the same policy space. There are far more configura- 
tions of four decision rules, assigned to parties in sys- 
tems of varying sizes, than it is conceivably possible to 
investigate here. I focus below on how Predator parties 
fare facing two different types of opposition, and on 
whether a lone “ideological” Sticker can succeed in 
pulling the mean policy of all other parties toward its 
preferred position. 


“Democratic” Aggregator versus 
Predator Partles 


A Predator scans the system for the largest party and 
moves relentlessly toward this, standing still only when 


B In both models this situation arises because all party leaders are 
programmed to do nothing but attack each other, which necessarily 
results in all party leaders ultumately finding themselves at the same 
part of the space. In effect both all-predator systems and the unre- 
constructed spatial model are prone to ignore potential sources of 
lost party support ansing from converging on the positions of other 
parties. 
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FIGURE 5. Emergent steady states In al-Aggregator party systems 
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are 
Each expenment involved 200 independent 75-cycle runs, partes and voters re-scattered 


it becomes the largest party itself, Repeated simula- 
tions show that a lone Predator always does well in a 
system otherwise populated by democratic Aggrega- 
tors. As might be expected analytically, a steady state 
quickly emerges in which Predator finds a (central) 
place where it is the largest party and Aggregators 
adapt to this, spacing themselves evenly around non- 
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central regions of the policy space. Pit two Predators 
in competition with a set of Aggregators, however, and 
the situation is quite different. Repeated simulations 
show that the Predators both move to the center, but 
if one ever becomes the largest party, the other attacks 
it. The Aggregators surround the Predators, boxing 
these into the center of the space. If the Predators 
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FIGURE 6. Party Sizes In a Three-Hunter, One-Predator Party System 
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move in one direction, some Aggregator in the other 
half of the space becomes the largest party; the Preda- 
tors reverse direction and move toward this?* Over- 
all, repeated simulations show that Aggregator par- 
ties are quite effective against two or more Predators. 
Indeed a more general conclusion can be noted in 
this context. Democratic Aggregator parties compet- 
ing with “unconstrained” party leaders (Hunters and 
Predators) rarely go to the center of the policy space, 
where Hunters and Predators tend to compete, and 
are typically not the largest party; but neither do they 
do very badly in such competitions. An Aggregator’s 
continuous adaptation to the center of its current set 
of supporters keeps it backing away from close contact 
with other parties, including those that attack its sup- 
port base. If the Aggregator’s support base erodes as 
a result, then its declining pool of supporters becomes 
less vulnerable to attack by party leaders using more 
aggressive search algorithms; the decline is thereby 
halted. 


Hunters versus Predators 


Simulations of party systems pitting Hunters against 
Predators generate thought-provoking results because 


4 This an improved and biologically more realistic Predator 
algorithm, in which the direction moved by the Predator would be 
a trade-off between the sizes of the other (prey) parties and their 
distance from the Predator. 


HUNTER 1 


HUNTER 2 HUNTER 3 


i S E 
Note Summary results from 100 independent 600-cycle tnals (l.e., overall n= 50,000), each tnal recorded after 160-cycle bum-off 
period. 


a lone Predator, contrary to what we might superfi- 
cially expect, proves ineffective against any number 
of Hunters greater than one.» Predator goes to the 
center of the space and the Hunters surround it, box- 
ing it in closely. Predator keeps moving toward the 
largest Hunter, but if it has any success, another Hunter 
quickly becomes largest and Predator changes direc- 
tion toward its new prey. It is uncanny, watching such 
simulations, to see several Hunting parties appearing 
to coordinate in surrounding the Predator. This emer- 
gent behavior arises despite the fact that each Hunter 
uses simple Pavlovian adaptation and has no clue there 
are other Hunters in the space. HUNTER’s random- 
reversed move in response to punishment is far more 
effective at getting out of a situation of falling support 
than PREDATOR’s more Cartesian approach, using 
superior knowledge of the geography of the system, 
to becoming the largest party. With more than one 
Predator, the situation deteriorates radically for the 
Predators because, should one Predator ever get to be 
the largest party (if Hunters temporarily drift apart, for 
example) another Predator immediately attacks it.” 
Figure 6 shows the results of an experiment with 
100 independent 500-cycle runs of a party system with 


B Competition between a lone Hunter and a lone Predator is patho- 
logical, but most amusing to watch m motion, having the structure 
of a Tom and Jerry cartoon. 

6 The biological analogy may not be helpful here. Predators of the 
same animal species tend not to canmibalize each other; the same 
should probably not be assumed for political parties 
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three Hunters and one Predator.”’ These box plots of 
party support levels show that the Hunters are sub- 
stantially more successful than the Predator. The lone 
Predator in this experiment was the largest of the four 
parties for 5% of all cycles and the smallest for 87% of 
cycles—a startlingly poor result given its prime target 
of becoming the largest party. 


Sticker versus Hunters 


The Sticker rule for setting party policy can be inter- 
preted as modeling an “ideological” party leader, con- 
cerned to promote a particular policy position rather 
than to maximize party support. This leaves open the 
substantive question of whether the leader’s unwilling- 
ness to compromise arises from an expressive desire to 
articulate a particular policy position or from a more 
instrumental desire to use party competition to pull 
the positions of other parties toward his or her ideal 
point. With one very important exception, however, 
simulation experiments using the model show that an 
ideological Sticker tends not to affect party competi- 
tion by dragging other parties toward it. The excep- 
tion, possibly underlying the gut feeling that being 
an intransigent Sticker may pull others toward you, 
arises in a two-party system when a Sticker competes 
with a Hunter. In this case, repeated simulations show 
that the Sticker’s position exerts a strong pull on the 
Hunter—which tends to seek votes close to the posi- 
tion of the Sticker. This is because the Hunter is most 
rewarded at positions closer to the center of the space 
than the Sticker, yet as close as possible to the position 
of the Sticker.* In this case the Sticker’s ideological 
intransigence does pull the lone Hunter toward it and 
is, in this way, instrumentally rewarded. 

The grave dangers of generalizing from two- to mul- 
tiparty competition, and the need for a dynamic model, 
can be seen clearly when we consider competition be- 
tween Hunters and Stickers. When there is more than 
one Hunter in the party system, interaction between 
the Hunters has a far greater effect on party com- 
petition than the policy position of a single intransi- 
gent Sticker. This was explored systematically in an 
experiment involving 200 independent 250-cycle runs 
of a system with three Hunters and one Sticker.” 
Sticker positions were rescattered each run, show- 
ing the impact on Hunters of different locations of 
the Sticker’s policy position. The experiment clearly 
showed that the unchanging position of the Sticker 
does not drag other party positions systematically to- 
ward it. Watching the simulations in motion, the sub- 
stantive reason for this is clear. If a Hunter strays to- 
ward the position of a Sticker located away from the 
center of the space, then this is quickly exploited by 


77 Output from each run was recorded after a 150-cycle burn-off 
od. 
The logic here thus resembles that of the comparative statics in 
the traditional spatial model 
2 The first 150 cycles of each run were discarded as the burn-off 
phase, with output from 250 cycles recorded for each of the 200 
runs—giving 50,000 cycles ın all 
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other Hunters; the straying Hunter is thus punished 
by losing support and reverses its direction of move- 
ment. Indeed the interaction of the simple Hunting 
algorithms creates a multiparty system of considerable 
complexity and yields results that seem substantively 
plausible yet quite distinct from those generated by 
either two-party competition or a static model. 

Figure 7 presents further results from this experi- 
ment that throw general light on the effectiveness of 
simple adaptive rules for setting party policy. It plots, 
for the Sticker and one of the Hunters, party support 
levels against the party x-coordinate throughout the 
full 50,000 cycles of the experiment.” This shows that 
support for the ideological Sticker is almost invariably 
less than support for the Hunter at precisely the same 
position on the x-dimension. The Hunter sets its x- 
coordinate in an adaptive response to the evolution of 
the system, whereas the same x-coordinate set by the 
Sticker is an inflexible nonresponse to this. We thereby 
see the need to distinguish between a party policy po- 
sition set as a result of adaptation to the evolution of 
the system and a position set without regard to this 
evolution. The two types of party tend to enjoy very 
different levels of support, even when they take iden- 
tical policy positions on the same dimension. A party’s 
support level depends not just on its position on some 
policy dimension, but also on whether it arrived at this 
position as a result of adapting to party competition or 
because it has an inflexible policy program. 


THE DYNAMICS OF PARTY COMPETITION 
IN IRELAND 


Scott Moss (2001), evaluating his agent-based model of 
the behavior of market intermediaries, argues model 
outputs should have the same statistical distributions 
as observed outputs in the social system being mod- 
eled, describing a theoretically relevant statistical dis- 
tribution of outputs as a “statistical signature” of the 
system. This offers a way to calibrate simulations to 
observations of the real world and, thereby, evaluate 
the empirical relevance of different models. Observing 
actual party competition, there is no chance of gather- 
ing dense time series data on small shifts in party policy 
positions and, thereby, reading this statistical signature 
in the real world. There are dense time series of pub- 
lished opinion polls tracking changes in party support 
over time. Traditional spatial models have little to say 
about either cross-sectional or time series variations in 
support for different parties, but those with an interest 
in real party competition know that these two types of 
variation are substantively significant yet observable 
features of areal party system in motion. These features 
of opinion poll time series are thus attractive moving 
targets for a dynamic model to hit; they are more at- 
tractive in many ways than real election results, which 
are availablé only at infrequent intervals and omit a lot 
of intervening dynamics. 


30 The curved lines are quadratic regressions summarming the scatter 
of points for each party. 


American Political Science Review 


Vol. 99, No. 2 


FIGURE 7. Hunter and Sticker Support levels for the Same x-Coordinate 
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In what follows, the moving target is the quarterly 
opinion poll series plotted in Figure 1, describing the 
dynamics of five-party competition in Ireland between 
1986 and 1997.7! To determine whether or not the tar- 
get has been hit, we need measures of substantive “fit” 
between simulated and real data. To measure the ex- 
tent to which simulations “fit” the mean size of each 
party, and thus the “shape” of the party system, I use the 
mean absolute difference (MAD) between simulated 
mean party Sizes and their mean sizes in the opinion 
poll series.“ To measure the extent to which simula- 
tions fit variations over the time in the size of each 
party, and thus the “volatility” of the party system, I 
use the MAD of standard deviations in party sizes over 
the simulated series from their standard deviations in 
sizes in the opinion polls. To get a sense of orders of 
magnitude for these measures of fit, the MAD between 
mean party sizes in published opinion polls and those 
in real elections over the same period was 2.3%. The fit 
between standard deviations of party sizes in opinion 
polls and those in real elections was 0.8%. Finally, to 
provide a more intuitive measure of the extent sim- 
ulations are capturing the substantive “shape” of the 
party system, I calculate the widely used measure of 
the “effective number of parties” (ENP). This gives 


31 Data made available by Michael Marsh. Coakley and Gallagher 
(2004) provide recent general discussions of Irish party competition 
Insh elections held in this period are discussed, ly, in Laver, 
Mar and Sinnott 1987; Gallagher and Sinnott 1990; Gallagher and 
Laver 1993, and Marsh and Mitchell 1999 

X That is, the mean over the five parties of the difference between 
mean simulated party size and mean opmion poll size. The measure 
for standard deviations 1s analogous 





a feel for the extent to which electoral support is 
concentrated on a small number of larger parties 
(Laakso and Taagepera 1979).* 

The first two columns of values in Table 2 give mean 
support levels for each party in both opinion polls and 
real elections, and standard deviations of these, over 
the period under investigation.* They also summarize 
the cross sectional variation in party support as the 
effective number of parties. The mean ENP for the 
five-party system described in this opinion poll series 
was 2.9, reflecting the most crucial substantive feature 
of Irish party politics; it is dominated by its two largest 
parties—Fine Gael and, especially, Fianna Fail. 

To simulate the dynamics of Irish party competition, 
we need a spatial representation of the policy positions 
of parties and voters. This was derived from expert 
survey estimates by Laver and Hunt (1992) relating 
to 1988-89, the closest to the start of simulated “real- 
ity” for which data are available. The simulations re- 
ported below use two policy dimensions. The first is the 
left-right dimension of economic policy found by 
Laver and Hunt to pervade party competition in 
Western Europe.” The second is the noneconomic 
policy dimension with the highest weighted mean 


B The ENP turns out to be a simple mathematical transformation of 
the population standard deviation of party sizes at a grven cycle. It 1s 
not a surtable measure of fit, 1f the sexes of two parties are inverted, 
and the simulation completely “wrong” in this sense, the ENP 1s 
unchanged 

H The percentages reported are each party’s share of the five-party 
vote, excluding votes for independent candidates and microparties 
3 The Laver-Hunt dimension trades off lower taxation against 
higher public spending 
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TABLE 2. “Real” and Simulated Party Support In Ireland, 1986—89 


Target 


Opinion Polls Elections 


51.0 46.1 
3.3 1.9 


25.4 30.0 
2.9 2.5 


11.9 12.8 
4.8 6.2 


8.2 73 
4.1 36 


3.5 3.8 
1.4 1.2 


2.3 
0.8 
2.9 3.0 


1: HHSSS 


Model 
2° HHSHH 3° HHSSH 


4: HHSHS 


48.4 48.9 48.5 
4.2 4.3 4.3 


21.6 245 22 6 
4.1 36 4.0 


18.2 18.0 19.3 
3.8 3.9 3.0 


8.2 4.4 
4.0 1.1 


3.6 4.1 
2.4 2.9 


2.6 
0.8 
3.0 


Note: Models are described as rule combinations In the order FF, FG, Lab, PFD, WP/DL A —Agoieontor. H—Hunter; S—Sticker 
Party means and standard deviattons are means over runs of the mean and standard dewation relevant party size for each run. 


salience in Ireland; this concerns attitudes on a con- 
tinued British presence in Northern Ireland. Laver 
and Hunt estimated positions of both party lead- 
ers and party voters on these dimensions, allowing 
an estimation of the policy position on each dimen- 
sion of the mean member of the voting population.* 
This in turn allows us to position each party policy 
position relative to the position of the mean voter, 
rescaling from the Laver-Hunt metric to the met- 
ric used by the simulations.” Simulated voters were 
randomly drawn from a normal distribution around 
the mean voter position, with the extremes of simu- 
lated policy dimensions scaled to three standard de- 
viations of the voter distribution on either side of this. 
It would be unrealistic to describe Irish party pol- 
itics as competition among five parties all using the 
same rule to set policy. Laver and Hunt (1992) asked 
experts about the extent to which each party sacri- 
fices policy objectives to get into office (or vice versa). 
The two large parties, FF and FG, were unequivo- 
cally judged office-oriented; WP/DL was unequivocally 
judged policy-oriented. The other two parties, Labour 
and PDs, were placed somewhere in between, though 
both parties publicly insist they are policy-driven. Thus, 


36 This was the mean position of each party’s voters, weighted by the 
share of the ftve-party vote won by the relevant party in the 1989 
election. 

37 The resulting party positions were as follows: 


poy Economic L-R Northern Ireland 


FG 108 —12.7 
LAB —21 1 —217 
PD —202 —76 
WP/DL —29.3 —28.7 


216 





as a “best estimate” using independent data, the three 
smaller parties are characterized as policy Stickers, 
with the other two, FF and FG, characterized as vote- 
seeking Hunters. Below, I assess these assumptions 
using simulation experiments to conduct systematic a 
sweep of rule assignments. 

Each experiment reported in one of the columns in 
Table 2 was based on 500 independent 43-cycle runs, 
using starting configurations of party and voter posi- 
tions derived from the Laver-Hunt data. Simulation 
results are compared with the 43-cycle “real-world” 
run generated by the opinion poll series. Different sim- 
ulation runs produce different results, even from the 
same starting configuration. One reason for this is that 
Hunting parties make random moves when punished 
with lower support; another is the random draw of 
voter ideal points. To assess the robustness of simula- 
tions to these random effects, two experiments were 
conducted using the “best estimate” rule assignment. 
The first involved repeated runs with the same random 
draw of voter ideals, confining the variation to random 
moves by Hunting parties. The second involved re- 
peated runs with different random draws of voter ideal 
points. The results showed that it made little difference 
whether or not voters were rescattered each run; the 
precise scatter of voter positions had little impact on 
party sizes in the simulations. The simulations below 
thus rescattered voters each run, removing the possi- 
bility that results depend on some particular scatter of 
voters. 

Model 1 in Table 2 uses the “best estimate” assign- 
ment of decision rules to parties. Mean support shares 
of the two largest parties, FF and FG, were simulated 
as 49.2% and 25.9% respectively, as opposed to 51.0% 
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and 25.4% in the opinion polls. Standard deviations of 
the FF and FG support shares over the simulated time 
series are also similar to those of the opinion poll series. 
Labour support share was overestimated by 7.3% in the 
simulation and PD share underestimated by 3.6%; time 
series variations in PD support were lower in simula- 
tions than the opinion poll series. The mean absolute 
difference between party support in opinion polls and 
“best estimate” Model 1 was 3.1%, as compared to the 
2.3% fit between real election results and opinion polls. 
An important baseline for assessing this is the MAD fit 
to the opinion poll series of a simulated static party sys- 
tem in which no party adapts its policy position; this was 
4.9%. Looking at time series variation in party sizes, 
the MAD of the best estimate model is 0.8%, exactly 
the same as the MAD between-party size variations in 
real election results and opinion polls. This compares 
with a MAD between variations in party sizes for the 
static p system and those in the opinion poll se- 
ries of 3.3.” At 2.8, the simulated effective number of 
parties is indistinguishable from that in real opinion 
polls. Overall, “best estimate” Model 1—FF and FG 
Hunters, other parties Stickers—fits the opinion poll 
series almost as well as real election results and sub- 
stantially better than the static model. This is certainly 
an encouraging start. 

Model 1’s assignment of party decision rules is both 
supported by independent expert survey data and 
substantively plausible, but it is important to know 
the sensitivity of simulation results to different para- 
meter settings. Since starting policy positions de- 
rive from independent data, the main parameters in 
this model are decision rule assignments to parties. 
Confining the investigation to three decision rules— 
Aggregator, Hunter, and Sticker—there are 3° (=243) 
permutations of rules among the five parties. A system- 
atic sweep of all permutations was conducted, with a 
simulation experiment for each permutation that was 
identical to that described for Model 1.” The results 
of this sweep are summarized in Figure 8. Our objec- 
tive is to assess models according to their ability to 
retrieve both the mean sizes and the variation over 
time in these sizes of the Irish parties over the pe- 
riod. Thus the horizontal axis of each plot shows the 
mean absolute deviation fit of party sizes generated 
by each model; the vertical axis shows for each model 
the MAD fit of standard deviations of party sizes over 
time. A model with the same signature as the opinion 


Æ The fit to a static system was estimated using a 500-trial experiment 
directly comparable to those reported in Table 2, in which all parties 
were Stickers. 

* Obviously, the variation in party sizes in the static model is zero. 
To reduce the 240 hours of continuous computing that would have 
been needed to conduct this sweep, JAVA code (available from the 
author) was written to control the NetLogo program Each exper 
iment first did 50 runs of 43 cycles. If the MAD from opinion poll 
party sizes was less than 50 after 50 runs—taking the 49 MAD 
for the static model as the benchmark—the experiment continued 
for the full 500 cycles; otherwise ıt was termimated after 50 cycles 
Given the substantive plaumbihty, backed by expert survey evidence, 
of demgnating both FF and FG as Hunters, all such rule combinations 
were subjected to a 500-run experiment. Thus all data in Table 2 
derive from 500-run experiments. 
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poll series would have zero MADs on both measures. 
Thus, the closer the position of a model to the origin 
of this “fit” plot, the better. The top panel in Figure 8 
plots these measures for the sweep of all 243 possible 
rule permutations. To give a sense of how to evalu- 
ate this information, the horizontal and vertical lines 
dividing the plot into segments show the fit on both 
criteria of the static (SSSSS) model applied to the Irish 
case. Simulations in the bottom-left segment of the plot 
thus performed better than the static model on both 
fit criteria. The bottom panel in Figure 8 enlarges the 
bottom-left area of the main plot, where the “best fit” 
models are found. This shows that our “best estimate” 
Model 1 (HHSSS) is among the four best-fitting mod- 
els in the sweep. There are three rule combinations 
that perform somewhat better. One rule combination, 
in particular (HHSHH)—all parties Hunters except 
Labour—performed better on both measures of sub- 
stantive fit. 

Table 2 reports substantive results for the four best- 
fitting models identified in Figure 8.*! It is both strik- 
ing and substantively plausible that all four best-fitting 
models make FF and FG Hunters, with Labour a 
Sticker, permuting Hunter-Sticker combinations be- 
tween PDs and WP/DL (all best-fitting rule combina- 
tions take the form HHSxx). Comparing these models, 
we see that improvements in the fit of party sizes are 
achieved by designating the PDs or WP/DL as Hunters 
rather than Stickers, holding other rules constant, with 
the overall best fit (HHSHH) achieved by making both 
PDs and WP/DL Hunters. Substantively this is because 
simulated PD and WP/DL sizes are then closer to 
those in the opinion polls. Thus, despite their public 
protestations that they are policy-driven, the simula- 
tions show both parties prospering under a Hunting 
behavior that causes their relatively extreme starting 
policy positions to become more central.** It is also 
substantively significant that any rule combination al- 
lowing Labour to Hunt or Aggregate increases what is 
already an overestimate of party support. This goes to 
the heart of another distinctive feature of Irish politics, 
which is that the main social democratic party, Labour, 
is relatively small in European terms. Simulations using 
the model strongly suggest that Labour’s small size is 
the result of ideological Sticking, and that the party 
would systematically have won more support during 
this period if it had gone Hunting for votes closer to the 
center. Overall, this “decision rule sweep” focuses our 
attention on decision rule combinations that make FF 
and FG Hunters and Labour a Sticker. The best-fitting 
models all “fix” these rule assignments and permute 
the Hunter and Sticker rules between the two small 
parties. 

Theoretically, the large set of simulation experi- 
ments reported above suggests that the model can use 


41 An independent set of 500-run experiments for Models 1—4 was 


run on a different computer platform, with results identical to those 
in Table 2, which thus seem robust and stable. 

“ Shortly after the end of the simulated period, WP/DL did ın fact 
fuse with the Labour Party—unequivocal evidence of a shift toward 
the center 


Policy and Dynamics of Political Competition May 2005 





FIGURE 8. Results of the Systematic Sweep of All Rule Permutations 
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independent estimates of key parameter values and mates of party and voter positions, and party decision 
Tule combinations to simulate “realistic-looking” dy- rules, the model generates time series of individual 
namics in an actual party system—dynamics with sta- party sizes, variations in these, and the cross-sectional 
tistical signatures very similar to those of published variation of sizes between parties that look similar 
opinion poll series. Starting from expert survey esti- to published opinion poll series forming the basis of 
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substantive descriptions of the Irish party system. Sys- 
tematic sweeping of decision rule permutations adds 
insights to our evaluation of the best-estimate model, 
in this case plausibly “fixing” the strategies of three 
of the five parties and suggesting that a decision rule 
allocation running counter to the public protestations 
of the parties may generate a more realistic time series 
of party support. All in all, this is an encouraging first 
start at calibrating the dynamic model to a real party 
system. 


CONCLUSIONS 


The Story So Far 


The most striking generic finding reported above is the 
success of the simple Pavlovian HUNTER rule, which 
not only locates high voter densities in a very low- 
information environment but also outperforms other 
decision rules. The emergent “coordinated” behavior 
of a group of Hunters confronting a Predator is in- 
triguing; while Hunters are not explicitly aware of each 
other, the rewards and punishments to which they re- 
spond do convey information about other agents. As 
a result an adaptive Hunter, responding to such re- 
wards and punishments, is more than a match for a 
more “rational” Predator. Analysis of Hunter-Sticker 
competition shows that Hunters win more support than 
Stickers for the same policy positions, because Hunters 
set their positions as an adaptive response to the dy- 
namics of party competition and Stickers do not. In 
simulation results not reported above, Hunters also do 
well in competition with Aggregators, tending to hunt 
for votes in higher-density areas of the space, while 
Aggregators adapt their positions into the areas some 
distance away from this. Not only does the adaptive 
HUNTER algorithm—“If it ain’t broke don’t fix it; if 
it bites you walk away”—work well in generic simula- 
tions, but also it does seem an intuitively plausible way 
to describe a potential rule of thumb for party leaders 
forced to make policy decisions when they have only 
crude summaries of information about the precise ideal 
points of voters. In simulations of a real party system, 
plausible assumptions about which parties are Hunters 
characterize all simulations that generate a statistical 
signature close to that of the real world. All of this 
draws attention to the HUNTER algorithm as a way 
to motivate future dynamic models of adaptive mul- 
tiparty competition in an environment of low infor- 
mation. 

The “democratic”? AGGEGATOR algorithm sets 
out to reflect the views of a party’s current supporters 
rather than to increase the number of those supporters; 
we should thus not be surprised that Aggregating par- 
ties tend to do less well at winning votes than Hunting 
parties. Nonetheless, while rarely kings of the jungle, 
Aggregating party leaders do tend to keep out of se- 
rious trouble, since continuously adapting party policy 
to the center of supporters’ policy positions prevents 
the Aggregators from close contact, and hence poten- 
tial damage, from other parties. Methodologically, the 


emergence of steady states in all-Aggregator party sys- 
tem draws our attention to the impact of treating sup- 
porters as a set of discrete adaptive agents rather than 
summarizing their ideal points as continuous smooth 
density maps. 

The fact that the model can be deployed in a plau- 
sible way to retrieve a real opinion poll series in Ire- 
land also gives considerable grounds for optimism. Us- 
ing independent data on party positions and decision 
rules, the model can generate time series that capture 
mean party sizes, as well-as time series variations in 
these, that come as close to published opinion polls 
as real election results and closer than output from a 
static model assuming that parties do not adapt policy 
positions at all. Systematic sweeps of potential rule 
combinations add insight to our ability to understand 
the real party system under investigation. In the Irish 
case they confirm expert judgments of FF and FG as 
vote Hunters; imply that Labour must indeed have 
been a policy Sticker to garner so little support from 
its starting policy position; and suggest, despite public 
protestations to the contrary, that the two smaller par- 
ties may be vote Hunters rather than policy Stickers. 
This gives us some hope that we may be able to use 
the model to go beyond inherently unreliable pub- 
lic statements by parties about their political strate- 
gies and infer strategies that better fit their observed 
behavior. 


The Way Forward 


Significant extensions to the basic model have already 
been implemented and are briefly noted below; space 
constraints mean future papers will report results de- 
rived from these. In the model described above, voters 
always support the closest party, sometimes making 
“hair-trigger” switching decisions, This can be modi- 
fied if voters change party subject to some “switching 
threshold,” shifting party support only if the difference 
in distances between their current closest party and 
their existing party exceeds this threshold. This cap- 
tures voters’ inability to perceive small differences in 
policy distances or the nonpolicy attractions of their 
current party, for example, party identification.* In 
a stochastic extension of the model, voters switch 
party support with a probability that is a function 
of a “policy gain” from switching parties, a switch- 
ing threshold, and a “switching sensitivity” reflecting 
the voter’s general propensity to switch. It is clearly 
substantively desirable to factor both the nonpolicy 
attractions of particular parties and voter inertia in 
party switching into the future development of the 
model. 

Future work will endogenize many model parame- 
ters now set exogenously. A second-generation model 
includes the endogenous birth of new parties.“ If an 


® Jackson (2003, 130) uses an analogous threshold, in a stochastic 
environment, in his computational model of party competition, to 
describe the minimum threshold above which voters perceive a policy 
move to convey information 

M This model has already been programmed. 
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existing supporter becomes dissatisfied because exist- 
ing party policy positions are “too far” away from his 
or her ideal point in some parameterized sense, he or 
she may, with some probability, change state from sup- 
porter to party leader, in effect founding a new party. 
The birth and subsequent success of new parties is a 
feature of real party systems that represents another 
weakness in many theoretical models of party com- 
petition. It is surely more realistic to model this as a 
change of state of some political actor already within the 
political system—an approach intrinsic to agent based 
modeling—than as the “entry” of a hypothetical out- 
sider from some metaphysical political sideline. Future 
work will also allow party decision rules to evolve en- 
dogenously since we do know that changes in these 
rules—for example, reforms of decision-making within 
the British Labour Party—can transform a party’s elec- 
toral prospects. The locations of supporter ideal points 
vis-a-vis parties will also be endogenized as a func- 
tion of the policy positions of all other agents, whether 
voters or party leaders, during the recent history of 
the system. For example, they might shift away from 
policy positions that leave them insolated from other 
voters with whom they interact or toward the party 
they currently support. This represents perhaps one of 
the largest substantive prizes for a dynamic model of 
party competition. Almost all observers of real poli- 
tics believe voter preferences to evolve dynamically in 
response to the development of political competition, 
yet static models of party competition find this type of 
feedback difficult to handle. A further development of 
the model, currently in progress, is to move it into a 
“multilevel” setting. In national politics, what nearly 
always actually happens is that party policy is set na- 
tionally, but elections are fought and won in individual 
constituencies—each of them biased samples of the 
national electorate. Another striking multilevel elec- 
toral setting is the election, on the basis of effectively 
independent national competitions, of members of the 
multinational European Parliament. The complex in- 
teractions involved in such multilevel competition are 
well within the scope of the type of agent-based model 
described in this paper. 

Perhaps the biggest lesson for the future is that it is 
feasible using the techniques of agent-based modeling 
not only to describe, but also to implement and explore, 
a model that sets policy-driven party competition in 
the endogenously evolving dynamic environment that 
most informed observers agree is a plausible way to 
describe real politics. We must make sacrifices to do 
this in a tractable way, of course. Using agent-based 
models means that we must set on one side any inves- 
tigation of the sophisticated, forward-thinking strate- 
gic calculations at the heart of many game theoretic 
models. Instead we characterize party competition as 
adaptive learning by party leaders in a complex system 
with very limited information feedback. The results 
reported above are intended to demonstrate that it 
is feasible and realistic to model this complex system 
using currently available methods and that, in so doing, 
we can advance the analysis of political competition in 
interesting and important ways. 
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APPENDIX: SIMPLIFIED NETLOGO CODE 
FOR KEY FEATURES OF THE 
AGENT-BASED MODEL 


SUPPORTER BEHAVIOR 


to join-closest-party 
set closest-party min-one-of parties [distance-nowrap 
myself] 
end 
„supporter finds ID of the closest party 


PARTY LEADER BEHAVIORS 


to update 
set old-mysize mysize 
;, store party size from previous cycle 
ask supporters [join-closest-party] 
;; ask supporters to reconsider affiliation, possibly 
changing party 
set mysize count supporters with [closest-party = myself] 
;, calculate new party size 
end 


to reverse-browse 
set heading heading + 90 + random 180 
fd 1 
end 
„party turns about face, moves ın random direction 
away from previous heading 


to aggregate 
set xcor (sum values-from supporters with [closest- 
party = myself] [xcor])/(count supporters with [closest- 
party = myself]) 
set ycor (sum values-from supporters with [closest- 
party = myself] [ycor])/(count supporters with [closest- 
party = myself]) 
update 
end 
„aggregators set policy on each dimension at mean 
position of current party members 


to hunt 
ifelse (mysize > old-mysize) [fd 1] [reverse-browse] 
update 
end 
hunters move in same direction as previous move if 
this increased party support, 
else turn about face and make random move in 
opposite direction 


to predate 

set largest-party max-one-of parties [mysize] 

if (mysize <value-from largest-party [mysize]) [set 
heading towards largest-party fd 1] 

update 
end 

predator identifies largest party and moves towards 

it if predator is not largest 


to adapt 
if (my-strategy = “sticker”) [update] 
if (my-strategy = “ageregator”) [aggregate] 
if (my-strategy = “hunter”) [hunt] 
if (my-strategy = “predator”) [predate] 
end 
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SYSTEM 

to go 
ask parties [adapt] 
set cycle cycle + 1 


end 
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Do Get-Out-the-Vote Calls Reduce Turnout? The Importance 
of Statistical Methods for Field Experiments 


KOSUKE IMAI Princeton University 


n their landmark study of a field experiment, Gerber and Green (2000) found that get-out-the-vote 

calls reduce turnout by five percentage points. In this article, I introduce statistical methods that can 

uncover discrepancies between experimental design and actual implementation. The application of 
this methodology shows that Gerber and Green’s negative finding is caused by inadvertent deviations 
from their stated experimental protocol. The initial discovery led to revisions of the original data by the 
authors and retraction of the numerical results in their article. Analysis of their revised data, however, 
reveals new systematic patterns of implementation errors. Indeed, treatment assignments of the revised 
data appear to be even less randomized than before their corrections. To adjust for these problems, 
I employ a more appropriate statistical method and demonstrate that telephone canvassing increases 
turnout by five percentage points. This article demonstrates how statistical methods can find and correct 


complications of field experiments. 


of democratic elections. In the 2000 general 

election, for example, the Democratic and Re- 
publican parties spent an estimated $100 million on 
such efforts urging likely supporters to vote (Dao 
2000). Not only do political parties engage in strategic 
mobilization of targeted voters, but also many public in- 
terest groups make nonpartisan appeals. In particular, 
telephone canvassing has been one of the most widely 
used voter mobilization strategies. Yet in their land- 
mark study of a field experiment, Gerber and Green 
(2000) found that phone calls encouraging people to 
vote reduce turnout by five percentage points on av- 
erage. Indeed, their experiment implies that among 
single-voter households, phone calls reduce turnout by 
27 percentage points. Gerber and Green (2000, 660) 
describe the negative effect of get-out-the-vote calls 
as “one of the most surprising results to emerge from 
our experiment.” Not only does this finding go against 
the conventional wisdom in the literature, but also it 
throws into question why so many millions of dollars 
are spent on telephone canvassing for every election. 


Vu mobilization campaigns are a central part 
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http //GKing.Harvard.Edu/MatchIt. 
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In this article, I introduce statistical methods into 
political science research that enable us to uncover the 
discrepancies between designed experimental proto- 
cols and actual implementation. The same methods 
can be used to analyze nonexperimental data, where 
deviations from randomization are to be expected. Ap- 
plication of this methodology to Gerber and Green’s 
data shows that the negative finding about telephone 
canvassing originates from errors that occurred dur- 
ing implementation of the experiment. These errors 
resulted in the failure of randomization that would 
have been difficult to detect (and indeed were not 
detected) without my methods. For example, among 
single-voter households, those individuals who did not 
vote in the last election were more likely to be assigned 
phone calls. A statistical test I introduce shows that 
under the procedure specified in their original article, 
the pattern of incomplete randomization observed in 
the data would only occur with a probability of about 
one in 300 million. This and other implementation fail- 
ures contributed to the highly implausible result that 
get-out-the-vote calls decrease turnout by 27 percent- 
age points among single-voter households. Moreover, 
Gerber and Green’s article used incorrect treatment 
and control groups in their analysis. Since the estima- 
tion of causal quantities necessarily involves the com- 
parison of these two groups, their reported estimates 
turn out to be incorrect. 

In order to correct these problems, I apply a more 
appropriate statistical method, propensity score match- 
ing, that has become standard in other fields when es- 
timating the causal effects of nonrandom treatments 
(see Horiuchi, Imai, and Taniguchi (2005) for an ex- 
ample of analyzing a field experiment with completely 
randomized treatment assignment). The main advan- 
tage of matching is that it does not require restrictive 
functional form assumptions common to usual regres- 
sion analysis (see Ho, Imai, King, and Stuart 2004). 
This method literally matches each observation in the 
treatment group (e.g., those receiving phone calls) with 
observations in the control group (e.g., those not re- 
ceiving phone calls) whose observed characteristics are 
otherwise similar. The method, thus, constructs control 
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and treatment groups that are systematically different 
only with respect to whether they received treatment. 
The propensity score facilitates the use of matching 
in multivariate settings where one needs to match on 
many variables (Rosenbaum and Rubin 1983). 

The results of this analysis reverse Gerber and 
Green’s finding to show that get-out-the-vote calls 
increase turnout by about five percentage points on 
average. This result is consistent with previous exper- 
imental studies on the topic (e.g., Adams and Smith 
1980, Eldersveld 1956, and Miller, Bositis, and Baer 
1981), all of which found that such calls increase voter 
turnout. Moreover, it corroborates the evidence from 
a subsequent field experiment by the same authors 
(Green and Gerber 2001). 

Despite the clear evidence in their data, Gerber and 
Green (2000) are careful not to overstate the nega- 
tive impact of phone calls! They do, however, declare 
phone calls to be ineffective. Moreover, in another ar- 
ticle, Gerber and Green (2001, 80) offer an explanation 
for the negative impact of phone calls, saying that “it is 
conceivable that the phone call irritated some people 
and made them slightly less likely to vote.”* The policy 
implication of their finding is that money should not be 
spent on telephone canvassing. 

I also demonstrate that Gerber and Green (2000) 
may have been too quick to dismiss the utility of mail- 
ings. The authors assumed that everyone who was sent 
postcards received and read them. As a consequence, 
when assessing the relative cost effectiveness of post- 
cards, Gerber and Green incorrectly compared the ef- 
fect of sending postcards with the effect of visits on 
those who were home and talked with the canvasser. 
I show that once the appropriate comparison is made, 
mailing postcards can represent a cost-effective alter- 
native to sending canvassers directly to households. 

By finding and correcting the errors of Gerber 
and Green’s study, therefore, this article makes two 
methodological contributions that are relevant to field 
experiments in general: introduce statistical methods 
that enable us to find problems in experimental de- 
signs, and illustrate how statistical methods can correct 
problems like these wherever they exist. 


Statistical Methods Are Essentlai for Fleld 
Experiments 


The methodological issues that arose in Gerber and 
Green’s experiment have important implications for 
field experiments in general. Long after the first such 
experiment was conducted by Gosnell (1927), field 
experiments have recently become an increasingly 


1 Gerber and Green (2000, 660) write, “Given our initial expectation 
that telephoning increases turnout, we take this [negative] result to 
mean that the null hypothesis of no effect cannot be rejected using a 
one-tailed test ” 

2 This study is based on a field expenment that Gerber and Green 
conducted in West Haven at the same tume as the New Haven study 
See the section Implementation Errors and Remaining Discrepan- 
cies for more mformation about the relationship between the two 
studies. 
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popular approach in the discipline (e.g., Howell and 
Peterson 2002 and Wantchekon 2003). This is an 
important development for political science because 
field experiments are a promising method that over- 
comes many of the limitations of purely observational 
studies. 

However, Green and Gerber (2002, 810-11) go too 
far to conclude that with field experiments, “rudi- 
mentary data analysis replaces scores of regressions, 
freeing the researcher from the scientific and moral 
hazards of data mining.”? If field experiments work 
perfectly—with perfect random selection of a large 
sample and completely randomized assignment of 
treatment among individuals—and, in addition, the 
empirical relationships are unambiguously strong, then 
sophisticated statistical analysis may be unnecessary. 
However, precisely because field experiments take 
place in the real world, such perfection is almost never 
achieved in practice. 

The problems encountered by Gerber and Green 
(2000) highlight the difficulty of implementing exper- 
iments in the field. Statistical methods are essential 
for detecting and correcting such unintended, but not 
entirely unforeseeable, complications that arise in field 
experiments. The point of the article is not, however, 
to discourage field experiments as infeasible. ‘The les- 
son is that only with appropriate statistical methods 
can we draw valid inferences from field experiments. 
Such efforts are worth undertaking precisely because 
field experiments may give us a better understanding 
of causal processes. 


Implementation Errors and Remaining 
Discrepancles 


The statistical methods introduced in this article de- 
tected the implementation errors in the field exper- 
iment of Gerber and Green (2000). I sent the first 
draft of this article to Gerber and Green, pointing out 
what appeared to be their implementation errors. This 
prompted the authors to take another look at their 
original data. After they investigated potential imple- 
mentation errors, Gerber and Green graciously docu- 
mented what went wrong and posted a new data set on 
their Web site. On the same Web site, they published a 
retraction of some numerical results from their ASPR 
article and a replacement for the key table. Further 
questions from my analysis led to additional updates 
of the revised data. 

According to their latest account, Gerber and Green 
sent two separate lists of registered voters to the phone 
bank that they hired for telephone canvassing. The 
phone bank mixed up one of the lists with that for 


3 Using the study of the effects of campaign contributions on polit- 
ical access as an example, Green and Gerber (2002, 810-11) wnte, 
“Rather than launch a complex multivariate analysis of the flow 
to and from donations and access, the researcher may obtain an 
unbiased assessment of the average treatment effect merely by cross- 
tabulating access by the size of contribution Rudimentary data anal- 
ysis replaces scores of regressions, freeing the researcher from the 
scientific and moral hazards of data mining.” 
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another field experiment conducted by Gerber and 
Green (Public Opinion Quarterly, 2001) in West 
Haven. Among the mistakes that resulted, some vot- 
ers received an appeal message asking them to donate 
their blood rather than a message asking them to cast 
their ballots. Consequently, the experiment was not im- 
plemented in the way it was described in the original 
article. 

In this article, I present the methods used for de- 
tecting the errors of Gerber and Green’s experiment. 
I also apply the same methods to the most recent data 
and conclude that the failure of implementation is still 
apparent with the new coding scheme. Indeed, a sta- 
tistical test shows that the incomplete randomization 
observed in the revised data would occur only with a 
probability of about one in 2 billion. Given that the 
implementation errors exhibit systematic patterns, the 
treatment assignment of the revised data appears to be 
even less randomized than the original data. I hope that 
in their response to this article Gerber and Green will 
track down and report the source of the randomization 
problems in both data sets. 

Finally, I analyze the revised data with a more ap- 
propriate statistical method. Whichever data are used, 
the substantive conclusion remains the same: get- 
out-the-vote calls increase turnout, whereas Gerber 
and Green’s analysis implies otherwise. Nevertheless, 
Gerber and Green’s data correction brings their esti- 
mates closer to mine. This is not surprising because 
the implementation errors of field experiments can 
be fixed in two ways: by adjusting statistically as I 
demonstrate in this article or by correcting the data 
as Gerber and Green did. When possible, correction 
of data is generally preferable to ex post statistical ad- 
justments. Once the experiment has been conducted, 
however, data correction by the experimenter alone 
often fails to fix all of the errors that occurred during 
implementation. That is, there is no way to change the 
fact that randomization failed in Gerber and Green’s 
experiment. Therefore, further statistical adjustments 
are necessary for the revised data as well. 


ADVANTAGES OF RANDOMIZED FIELD 
EXPERIMENTS 


A central goal of scientific inquiry is to make causal 
inferences. In the physical sciences, experiments are 
essential for this purpose. In contrast, for many political 
scientists, analysis of observational data and compar- 
ative case studies have been the more common ap- 
proaches, and relatively few researchers conduct ex- 
periments. Recently, Green and Gerber (2002, 831) 
have characterized the state of the discipline as re- 
sembling “monocrop agriculture, efficiently generat- 
ing prodigious quantities of nonexperimental research 
but deeply vulnerable to an experimental intrusion 
that could consume the stock of received wisdom.” 
Indeed, the experimental approach can often provide 


* Gerber, Green, and Kaplan (2002, 1) conclude that “at some point, 
the only possibility of further learning comes from experimentation.” 


more insight into causal processes with fewer arbitrary 
assumptions than would be necessary in observational 
studies (e.g., Kinder and Palfrey 1993). 

Gerber and Green advocate field experiments, which 
are attempts of randomized interventions into real- 
world settings, as the best way to conduct empirical 
political science research. Although laboratory experi- 
ments offer greater control, conclusions based on such 
studies are often difficult to generalize. In contrast, 
field experiments combine real-world settings with a 
significant level of control over experimental design 
and produce more generalizable results. 


The Role of Randomization 


Fstablishing causality involves the comparison be- 
tween what actually occurred and what might have 
happened under different circumstances. The funda- 
mental problem of causal inference is that we never 
observe the counterfactual scenario (e.g., Holland 1986 
and King and Zeng 2003). For example, in order to 
measure the causal effect of British colonial rule on the 
postcolonial economic development of India, one must 
estimate the economic growth of India if it had not been 
ruled by the British empire. Answering such counter- 
factual questions is difficult, but doing so is necessary to 
address important research topics in political science. 

More formally, let ¥(7; = t) be the potential out- 
come under the treatment status, t, for unit i. Here, 
T, is an indicator variable that is equal to one if this 
unit received the treatment and zero otherwise. Then 
a treatment effect for unit 1, 7E,, can be defined as 


TE, = Y(T, = 1) - Y(T, = 0). (1) 


If a unit belongs to the treatment group, we only ob- 
serve Y,(7; = 1), and the counterfactual outcome if the 
same unit had not received the treatment, ¥,(7| = 0), 
is unknown. In the context of voter mobilization, if a 
voter received a get-out-the-vote call, we never know 
for certain whether he or she would have voted had the 
call not been received. Therefore, the validity of causal 
inference rests on how reliably we estimate the po- 
tential outcome under a counterfactual scenario. This 
is true even in experimental settings since we cannot 
repeat the identical experiment on the same unit in the 
same environment. 

One way to achieve this goal is to form an appro- 
priate control group that is similar to the treatment 
group in all characteristics except for the treatment 
status. In experiments, randomization plays a critical 
role in obtaining such a control group. By giving a 
treatment to randomly selected units, all characteris- 
tics of the treatment and control groups, except for 
whether they received the treatment, become similar 
as the sample size increases. As a whole, the two groups 
are essentially identical if there is a large sample. In 
this manner, randomization equalizes unobserved as 
well as observed characteristics of the two groups. If 
treatment is completely randomized, we can simply use 
the mean difference of the observed outcome between 
the treatment and the control groups as an unbiased 
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estimate of the average treatment effect. A serious 
limitation encountered in observational studies, in con- 
trast, is that researchers do not possess the powerful 
tool of randomization (e.g., Achen 1986). 


Quantitles of Interest In Fleld Experiments 


In many field experiments, the distinction between 
assignment of treatment and receipt of treatment is 
critical because researchers can often randomize the 
former, but not the latter. In the field, not everyone 
assigned the treatment by researchers actually receives 
it. In addition, some of those who are not assigned the 
treatment may receive one. Because of this noncom- 
pliance problem, the estimation of treatment effects 
in Equation (1) requires additional assumptions that 
allow for further statistical adjustments. 

The difficulty of estimating treatment effects leads 
many to estimate another causal quantity, known as 
the Intention-To-Treat (ITT) effect. Unlike the treat- 
ment effect, the ITT effect does not take into account 
whether those assigned the treatment received it. That 
is, the ITT effect represents the effect of treatment as- 
signment rather than treatment itself Unlike the treat- 
ment effect, the ITT effect is relatively easy to estimate 
so long as the treatment assignment is randomized. 
Indeed, for some cases when the information about 
who actually received the treatment is unavailable, one 
can only estimate ITT effects. Formally, let Z, be the 
treatment assignment indicator, which is equal to one 
if unit i is assigned the treatment and zero otherwise. 
Then, the ITT effect for unit i can be defined as 


ITT, = Y (L, Z = 1) = Y(T, Z, = 0), (2) 


where T, denotes whether the treatment was actually 
applied and is equal to either 0 or 1. 

Figure 1 shows the diagram of typical randomized 
field experiments. Here we assume that subjects would 
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never receive the treatment if they are not assigned 
one; ie., T, = 0 if Z, = 0. Because of the selection bias 
due to noncompliance described above, we cannot di- 
rectly compare those who received the treatment with 
the units of the control group. The ITT analysis is valid, 
on the other hand, as long as the treatment assignment 
group is compared with the control group. Further- 
more, in many cases we can only estimate the treatment 
effect for compliers (i.e., those who would receive the 
treatment only if they were assigned one), and to do 
so, we need to identify compliers in the control group. 

Once we identify such individuals, we can use them 
to estimate the average potential outcome under no 
treatment for compliers. 

ITT effects may substantially differ from treatment 
effects. Consider a hypothetical example where an in- 
ternational organization plans an AIDS prevention 
campaign in Africa and must choose from two pro- 
posals, The first proposal is to distribute educational 
pamphlets to local high schools. The second proposal 
is to put up an educational message on roadside bill- 
boards. The first policy would have the greatest treat- 
ment effect if those pamphlets are actually read by 
students at school. However, it is questionable whether 
school teachers will read them to students. It is also 
possible that the youth in schools are less likely to be 
infected with AIDS in the first place. Therefore, one 
would expect the ITT effects of this proposal to be low 
despite its potentially high treatment effect. In contrast, 
the billboard advertisements may have a higher ITT 
effect because they are more likely to be read by the 
target population. Thus, policy-makers may prefer the 
proposal to use billboard advertisement even if it has 
a smaller treatment effect. 

ITT effects are often useful for policy makers who 
are interested in the cost effectiveness of policy pro- 
grams. In contrast, academic researchers may care 
more about treatment effects in order to learn about 
underlying causal processes. For example, electoral 
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candidates may want to know about how many visits 
or postcards are necessary to increase voter turnout by 
one percentage point. In this case, it is not necessary to 
know how many voters actually talked to canvassers or 
read postcards. On the other hand, political scientists, 
who want to assess the relative effectiveness of vari- 
ous canvassing methods need this extra information. 
Even when personal canvassing seems less effective, 
for example, it may only appear ineffective because 
voters are more difficult to reach by visits than by 
postcards. Hence, the different compliance rates for 
the two methods become critical. 


THE NEW HAVEN VOTER MOBILIZATION 
STUDY 


In this section, I replicate and extend Gerber and 
Green’s analysis of the voter mobilization study. 
Gerber and Green (2000) designed and conducted an 
experiment where registered voters in randomly se- 
lected households of New Haven were encouraged to 
vote in the 1998 general election by means of per- 
sonal visits, phone calls, and postcards. They then ex- 
amined voting records and analyzed which strategies 
had increased voter turnout. In addition to the voting 
record of the 1998 election, the data include covari- 
ates that describe the following characteristics of each 
registered voter: number of registered voters in the 
household (one or two), age, party affiliation (regis- 
tered Democrats, registered Republicans, or others), 
voting record in the last general election (voted, did 
not vote, or was not registered for 1996 election), and 
ward of residence in New Haven (29 wards). 


Inefficient Experimental Design 


Table 1 shows the unusually complicated experimen- 
tal design of the original study with the substantial 
overlap of different treatment assignments. Over 40% 
of voters in the sample were assigned more than one 
treatment. For example, 122 voters were assigned to re- 
ceive three postcards, a phone call, and a personal visit 
with the civic duty message. Further variation in the 
nature of the treatment was possible because Gerber 
and Green used three different appeal messages; civic 
duty, neighborhood solidarity, and close election. The 
authors note that the neighborhood solidarity message 
was not used for phone calls (Gerber and Green 2000, 
656). Altogether, this design produced a total of 45 dif- 
ferent treatment combinations and their corresponding 
potential outcomes. 

Such complex experimental design leads to the inef- 
ficient estimation of treatment effects unless one makes 
arbitrary assumptions. This is unfortunate since the ad- 
vantage of experimental methods is to avoid additional 
assumptions that are often necessary in observational 
studies. For example, Gerber and Green (2000) assume 
that the effect of telephone canvassing remains the 
same regardless of whether voters have received other 
treatments. However, phone calls may not increase the 
probability of voting as much for those voters who al- 


TABLE 1. The Original Experimental Design 


Reported In Gerber and Green (2000) 


Mall 


Phone 
Visit 
Civic 
Nelghbor/ctvic4 
Close 


Nelghbor/ctvic* 
Close 


501 


No phone 
Visit 
Civic 227 
Neighbor 194 
206 


870 825 
Neighbor 10,800} 764 787 
Close 783 
Note: The figures represent the number of registered voters In 
New Haven for each treatment assignment combination. For 
example, 122 voters were assigned to receive three postcards, 
a phone call, and a personal visit with the crvic duty message. 
Treatment assignment groups of mterest are underlined. A box 
highlights the large contro! group. 
“For phone calls, the civic duty appeal was used instead of 


the neighborhood solidanty message (Gerber and Green 2000, 
656). 





ready have received a personal visit. Furthermore, the 
timing of contact differs from one canvassing method 
to another and this variation was not randomized; e.g., 
phone calls were made during the three days prior to 
the election, whereas personal visits were made over 
a period of four weeks. Such systematic differences 
in the administration of multiple treatments will yield 
incorrect inferences unless properly controlled in the 


analysis. 


Incorrectly Identified Treatment Assignment 
and Control Groups 


Gerber and Green (2000) also incorrectly identified 
the treatment assignment and control groups used in 
their field experiment and, as such, failed to estimate 
their causal quantities of interest. For example, when 
estimating the marginal effect of phone calls, Gerber 
and Green used the treatment assignment group that 
includes those who were also assigned other treatments 
such as personal visits and postcards (the upper two 
rows in Table 1). Their control group included those 
voters who were assigned other treatments (all cate- 
gories in the bottom two rows in Table 1). In order 
to correctly estimate the treatment and ITT effects, 
the appropriate control group should consist solely of 
the 10,800 voters who were assigned no treatment and 
hence received no intervention. Likewise, the mem- 
bers of the treatment assignment group for phone calls 
should not include those who were assigned any other 
treatment. 
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TABLE 2. Treatment Assignment and Contro! Groups Based on 


the Revised Data 


Phone 
Visit 
Crvic 
Clvic/blood* 
Crvic/blood-cemc? 
Neighbor 
Neighbor/clic® 
Neighbor/crvic-nelghbor? 
Close 
No visit 
Clvic 
Clvic/blood* 
Clic/blood-cmic? 
Nelghbor 
Nelghbor/ctvic® 
Nelghbor/civic-neighbor? 
Close 
No phone 
Visit 
Crvic 
Nelghbor 
Close 
. No visit 
Crvic 


Neighbor 10 
Close 


136 
175 
194 


815 
764 B49 
772 817 


I a — a — 
Note. The figures represent the number of registered voters In New Haven for each treatment 
assignment combination For example, 104 voters were assigned a phone call with the blood 
donation message and a personal visit with the civic duty appeal. Treatment assignment 
groups of Interest are underlined. A bax highlights the control group 

“For phone calls, the blood donation appeal was used Instead of the civic duty meesage. 
b For phone calls, erther the blood donation or the chic duty appeal was used. 

c For phone calls, the crc duty appeal was used instead of the neighborhood solidarity 


message 
d For phone calls, either the civic duty or the neighborhood solidartty appeal was used 


This implies that the ITT and treatment effects re- 
ported in Gerber and Green (2000) are confounded 
by the effects of other treatments.” In experiments, an 
appropriate control group is critical to ensure internal 
validity (e.g., Campbell and Stanley 1963). In princi- 
ple, it is advisable to minimize the number of treat- 
ments in field experiments. Although factorial designs 
may be feasible in laboratory experiments, additional 
complications such as noncompliance make it difficult 
to estimate the effects of multiple overlapping treat- 
ments in field experiments. In this article, I focus on 
the marginal effects of each treatment rather than their 
interaction effect, as the latter would involve additional 
assumptions and few data are available to estimate such 
quantities. 


5 This may lead to the underestimation of the treatment effect since 
the control group used by Gerber and Green includes those who 
received other treatments Many voters in the treatment assignment 
group were also assigned one or more of the other treatments. The 
treatment effects are likely to be small for those who have already 
received other treatments. 





Experlmental Design Based on the 
Revised Data 


As noted above, the analysis in the initial draft of 
this article detected the implementation errors and 
led to the subsequent revisions of the original data. 
Table 2 shows the treatment assignment and control 
groups based on the most recent data and Gerber and 
Green’s latest version of their experimental design. The 
total number of treatment combinations is now seventy, 
making the experimental design even more complex. 
For the analysis of the revised data, I correct the treat- 
ment group for telephone canvassing to include only 
those voters who were assigned no other treatment. 
I also exclude those who were possibly assigned the 
blood donation messages. This yields the total of 428 
voters with the civic duty appeal and 377 individuals 
with the close race message. The new control group 
consists of 10,582 voters who were assigned no treat- 
ment. 

The analysis of the revised data reveals discrepan- 
cies between Gerber and Green’s description of the 
implementation errors and the altered coding scheme. 
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TABLE 3. Estimated Average Intentlon-To-Treat (ITT) Effects on Voter 
Tumout Assuming Complete Randomization (Percentage Points) 


Original Data 


Gerber & Green 
(Incorrect Groups) 


3 times 


(Correct Groups) 


Revised Data 


Corrected ITT 
(Correct Groups) 


Note: The left column of estimates displays the results based on the incorrectly Identified groups 
as published in Gerber and Green (2000) The ITT estimates m the middle column use the proper 
treatment assignment and contro! groups, thereby correcting the original! analysis of Gerber and 
Green (2000). Finally, the estimates In the nght column are based on the revised data using the 
correct treatment assignment and control groupe. Standard errors are in parentheses. 

“The ITT effect of phone calls was not reported by Gerber and Green (2000) and 1s calculated 


based on their method. 


For example, on their Web site they describe one of 
their errors as follows: “Subjects who would have re- 
ceived Civic Duty mail or personal appeals received 
phone appeals requesting a Blood Donation” (see foot- 
note 4). Although this error should not affect the con- 
trol group of those who were assigned no treatment in 
the first place, the revised control group has about 300 
voters fewer than the original group. Such remaining 
inconsistency calls for further clarifications about the 
coding changes beyond what is currently documented. 


ANALYSIS ASSUMING COMPLETE 
RANDOMIZATION WITH CORRECTED 
TREATMENT ASSIGNMENT AND CONTROL 
GROUPS 


With the corrected treatment assignment and control 
groups, I reestimate the average ITT and treatment ef- 
fects by applying the statistical method used in Gerber 
and Green (2000), which assumes complete random- 
ization of treatment assignments. 


Estimation of the ITT Effect 


Under the assumption of complete randomization, the 
treatment assignment is independent of all observed 
and unobserved individual characteristics. Therefore, 
the difference in the sample means of the treatment 
assignment and control groups is an unbiased estimate 
of the average ITT effect. Namely, 


fre yet YZ Yet WL Z) 


N, N, (3) 





where N; = )~“, Z, is the size of the treatment assign- 
ment group No = X% (1 — Z,) is the size of the control 
group, and N = No + Ni. 

Table 3 shows the results of the ITT analysis us- 
ing the correct treatment and control groups. First, 
the corrected ITT analysis in the middle column con- 
firms the conclusion of Gerber and Green (2000) 
that personal canvassing is the most effective method 
for increasing voter turnout. Second, get-out-the-vote 
calls have a significant negative effect on turnout. Us- 
ing the appropriate treatment assignment and control 
groups does not change the odd finding of the orig- 
inal article that telephone canvassing reduces voter 
turnout. 

As one would expect, altering the data also changes 
the estimates. The analysis of the revised data with 
correct groups (in the right column) suggests that the 
overall ITT effect of phone calls is only slightly nega- 
tive, with a larger standard error. In the next section, 
however, I show that the data correction alone does 
not solve the entire problem. In principle, the imple- 
mentation errors of field experiments cannot be fixed 
by the experimenter after the fact without statistical 
adjustments. 

Mail canvassing also mobilizes voters. (Gerber and 
Green 2000, 661) argued that “even if the effective 
marginal costs of canvassing were doubled, face-to- 
face mobilization would still be cost effective.” This 
conclusion, however, is based on their assumption that 
all voters who were sent postcards actually received 
and read them (659, fn 10). Such an assumption is not 
warranted because many cards may not have reached 
a voter due to changes of address or may have been 


6 In the case of phone calls, for example, Nj = 958 and No = 10, 800. 
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discarded unread as junk mail. As a consequence, 
Gerber and Green (2000) underestimated the effec- 
tiveness of sending postcards by incorrectly compar- 
ing the estimated ITT effects for postcards with the 
estimated treatment effects for visits. The ITT effect 
may well be the most relevant for the evaluation of 
cost effectiveness in this case, but the comparison that 
was made here was inconsistent. The valid compar- 
ison of different canvassing methods must be made 
using the same estimand to evaluate their relative 
effectiveness. 

In contrast, the corrected ITT analysis in the mid- 
dle column of data in Table 3 makes the appropriate 
comparison of the ITT effects across the three mo- 
bilization strategies. Given the relatively low cost of 
sending postcards compared to visiting each voter’s 
residence, policy-makers might reasonably prefer to 
use postcard mailings as a cost-effective method to 
raise voter turnout. The corrected analysis also indi- 
cates that sending a postcard three times is much more 
effective than mailing it once or twice. This provides 
evidence against the assumption of Gerber and Green 
(2000) that the effect of mail canvassing is linear in the 
number of postcards sent. 


Instrumental Variables Estimation 
of Treatment Effect 


Moving from the estimation of ITT effects to that 
of treatment effects necessitates attention to compli- 
ance with treatment assignment. In field experiments, 
noncompliance often occurs because researchers can- 
not force everyone assigned a treatment to receive it. 
Table 4 shows that in Gerber and Green’s experiment, 
only one fourth of those assigned a treatment actually 
received it. The noncompliance exists mostly because 
voters were not at home (or were at home but unwill- 
ing to talk to a canvasser) when they were visited or 
telephoned. Furthermore, among 217 voters who were 
assigned a phone call and a visit, only 27 of them actu- 
ally received both treatments, illustrating the difficulty 
of estimating the effect of multiple treatments in field 
experiments. Note the significant difference in compli- 
ance rate for phone calls between the original and the 
revised data. This implies that the coding changes did 


TABLE 4. Low Compliance Rates In Gerber 
and Green’s Fleld Experiment 
Original Data 

Compliance 
Rate N 
253% 242 


Revised Data 


Compllance 
Rate N 

30.7% 247 

28.3% 740 
Phone & visit 12.4% 27 14.5% 16 
Note. The compllance rate represents the percentage of those 
who recetved treatments among those assigned them. N re- 
presents the number of voters who actually received treat- 
ments. For example, only about one fourth of voters answered 
the phone when called 


Phone 


Visit 28.1% 756 
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not occur randomly and that systematic changes have 
been made to the original data. 

Instrumental variables (IV) estimation is a well- 
known statistical method that identifies average treat- 
ment effects by focusing on those who would receive a 
treatment only if assigned (e.g., Angrist, Imbens, and 
Rubin 1996).’ An “instrument” is a variable that satis- 
fies an assumption referred to as the exclusion restric- 
tion; i.e., the instrument influences the outcome only 
through its effect on the treatment. In other words, the 
instrument cannot have any direct or indirect effect 
through variables other than the treatment variable. 
In field experiments, the assignment of treatment, if 
completely randomized, may serve as an ideal instru- 
ment. In Gerber and Green’s study, the fact that vot- 
ers were assigned telephone canvassing via random 
numbers generated by a computer is unlikely to affect 
anything other than the probability of their receiving 
phone calls. Formally, the exclusion restriction can be 
written Y¥(7;=t, Z,=1)=Y(T,=t, Z,=0) for t=0,1 
where Z, is the indicator variable for treatment assign- 
ment. 

The IV estimator is biased in small samples, but 
it consistently estimates average treatment effects for 
compliers in large samples when treatment assignment 
is completely randomized. Gerber and Green (2000) 
employ this approach to estimate the marginal treat- 
ment effects of telephone calls and personal visits for 
the subgroup of those who received an assigned treat- 
ment. The ITT effect divided by the compliance rate 
gives the IV estimate of complier average treatment 
effect. Namely, 


nee 


ITT 
SN ta (4) 
where the denominator represents the estimated com- 
pliance rate as appears in Table 4. 


Table 5 presents the IV estimates of the average 
treatment effects of telephone and personal canvass- 
ing.” Gerber and Green (2000) found that get-out- 
the-vote calls have a significant negative effect of 


7 Some argue that the treatment effect for the entire population 1s 
a more meaningful quantity (e.g, Balke and Pearl 1997) The inef- 
ficient design and high noncompliance rate of Gerber and Green’s 
experment make estimating such a quantity difficult. I computed the 
nonparametric bounds of the average treatment effect for 

visits and phone calls and found that they are [—27.9%, 43 9%] 
and [—28.1%, 46.6%], which implies that the data from this field 
experiment are almost entirely uninformative about this quantity of 
interest 

8 To be precise, this is not always the case. In Gerber and Green 
2000, for example, the existence of potential spillover effects within 
households will violate this assumption even if the assignment is 
completely randomized. However, since Gerber and Green’s repli- 
cation data do not contain the information about which household 
each voter belongs to, it 1s impoesible to conduct the household-level 


? Gerber and Green used the two-stage least squares and the two- 
stage probit regression for phone calls since the phone treatment 
assignment was correlated with the postcard assignment. Both are 
variants of IV estimation presented here See, eg, Angrist and 
Imbens 1995 for a complete discussion. 
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TABLE 5. 
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Instrumental Varlables (IV) Estimates of Average Treatment 


Effects on Voter Turnout (Percentage Points) 


Onginal Data 


Gerber & Green 
(Incorrect Groups) 


Overall effect 
Single-voter households 
Two-voter households 
Clic duty 
Nelghborhood solidarity 


Revised Data 


Corrected IV 
(Correct Groups) 


Visit 
13.9 
(3.8) 


Mote. The table shows that the negatrve finding for telephone canvassing ts driven by the large and 
negative effects for single-voter households. The teft two columns of estimates display the results based 
on the Incorrectly kdentrhed groups used by Gerber and Green (2000). The IV estimates in the middle two 
columns use the proper treatment assignment and control groups, thereby correcting the original anatysis 
of Gerber and Green (2000). Finally, the estimates In the nght columns are based on the revised data 
using the correct treatment assignment and control groups. Standard errors are m parentheses. 

“Since Gerber and Green (2000) did not report the separate analysis of phone calls for different household 
types and appeal messages, those estimates In the table are calculated based on thelr method. 


five percentage points on turnout.!? Moreover, their 
inappropriate use of overlapping treatments obscured 
greater problems. Correcting the treatment assignment 
and control groups makes the effect even larger, reach- 
ing —12 percentage points with a standard error of 
seven percentage points. These IV estimates based on 
the original data suggest that get-out-the-vote calls en- 
couraging people to vote discourage them from casting 
their ballots. 

Note that although the negative effect for single- 
voter households seems to persist in the revised data, 
the estimated overall effect of phone calls is now small 
with a large standard error. This is similar to the situa- 
tion of ITT estimates mentioned above in that the data 
correction brings Gerber and Green’s estimates closer 
to positive effects. As I show below, however, data cor- 
rection alone is not sufficient to fix the implementation 
eITOrs. 

Finally, the corrected IV estimates for personal visits 
are much greater than those from the original analysis 
for both original and revised data, reaching to an in- 
crease of more than 10 percentage points in turnout. 
This significant difference is solely due to the correc- 
tion of treatment assignment and control groups. This 
is clear evidence against the assumption of Gerber and 
Green (2000, 660) that the effects of different canvass- 
ing methods are constant and additive. 


10 While Gerber and Green’s two-stage least-squares analymis (with 
all covariates) indicates a smaller negative effect, their two-stage 
probit analysis shows that the effect of phone calls is about negative 


five percentage points and statistically significant. 





METHODS FOR EVALUATING THE 
IMPLEMENTATION OF FIELD EXPERIMENTS 


While the IV method is useful in many situations, 
the validity of its use relies on the key assumption 
that treatment assignment is completely randomized. 
Below, I show that this assumption was violated in 
Gerber and Green’s experiment and that the violation 
led to their negative finding about telephone canvass- 
ing. Indeed, I now demonstrate, with statistical tests 
I introduce, that the pattern of incomplete random- 
ization observed in Gerber and Green’s original data 
would occur with a probability of less than one in 
300 million. These results led to the discovery of the 
implementation errors of their experiment. 

The fact that the errors did not occur randomly is an- 
other indication of failed randomization in this experi- 
ment. For example, Gerber and Green’s revisions of the 
original data increased the overall rate of compliance 
for phone calls by five percentage points (see Table 4). 
This difference is statistically significant (p-value, 0.01), 
implying that the implementation errors systematically 
affected those individuals who were more likely to an- 
swer the phone when called. Thus, IVY estimation, which 
assumes complete randomization, is not an appropriate 
method to analyze either the revised or the original 
data. 

Detecting the implementation errors of field experi- 
ments is generally a difficult task. The main challenge 
arises from the fact that statistical tests based on the 
observed data cannot guarantee that the treatment as- 
signment is randomized with respect to unobserved 
variables. For this reason, it is advisable to gather as 
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many important covariates as possible when design- 
ing field experiments. The validity and efficiency of 
resulting estimates can be improved by incorporating 
covariates in the randomization procedure (e.g., using 
stratification methods) as well as in the data analysis. 


Assessing Sensitivity of IV Estimates 


Examination of different subgroups is one way to check 
the robustness of IV estimates. Such analysis can be in- 
formative since the IV estimate of the overall treatment 
effect is equal to the weighted average of estimates 
for different subgroups. The analysis shows that the 
large negative effect among single-voter households 
underlies Gerber and Green’s pessimistic conclusion 
about the effect of telephone canvassing. In particular, 
the overall estimate of negative five percentage points 
reported in Gerber and Green (2000) is largely due to 
the significant negative effect of 14 percentage points 
found for single-voter households, with a standard 
error of four percentage points.’ Similarly, the cor- 
rected IV estimate for this subgroup is —27 percentage 
points (with a standard error of 10 percentage points), 
which leads to an overall effect of —12 percentage 
points. Large negative effects for single-voter house- 
holds contrast with positive effects for two-person 
households. For the revised data, the gap between the 
two subgroups is also apparent; i.e., —13 percentage 
points for single-voter households and positive seven 
percentage points for two-voter households (with stan- 
dard errors of nine and eight percentage points, respec- 
tively). 

Looking closely at subgroups that received different 
messages also reveals large negative IV estimates for 
the effect of get-out-the-vote calls. The corrected IV 
analysis for the original data shows that both civic duty 
and close race messages significantly reduce turnout, 
by more than 10 percentage points.” Although the 
corresponding standard errors are larger, the analy- 
sis of the revised data reveals even larger differences 
among the appeal messages; the close race message 
reduces turnout by 12 percentage points, whereas the 
civic duty appeal increases turnout by five percentage 
points. The inconsistency of the estimates across data 
sets as well as subgroups raises questions about the va- 
lidity of conclusions regarding the effect of telephone 
canvassing. 


Detecting Incomplete Randomization 


Although it is generally difficult to check the random- 
ization with respect to unobserved variables, the ex- 
perimental design of Gerber and Green (2000) allows 
for such a test. In particular, both personal visits and 
phone calls are supposed to be assigned with randomly 
selected appeal messages: civic duty, neighborhood sol- 


11 Gerber and Green (2000, 658) report the results of the separate 
subgroup analysis for personal canvassing but not for phone calla 
12 Applying Gerber and Green’s incorrect groups, J also find that the 
civic duty appeal has a significant negative effect of eight percentage 
points, while the effect of the close race message is only slightly 
negative. 
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idarity (not used for phone calls), and close election. 
If the assignment of appeal messages is random, one 
should see no systematic difference in compliance rates 
among different messages. This is because the ran- 
domization would prevent one message from being 
assigned to a group of people who are more likely to 
receive the treatment. Since the probability of voters 
being at home and willing to talk to a canvasser when 
called or visited depends on their unobserved charac- 
teristics as well as their observed ones, this test allows 
us to check the balance of unobserved voter character- 
istics. (“Balance” refers to a similar distribution for a 
variable in two subgroups. ) 

This analysis reveals that for phone calls, those who 
were assigned the close race message are on average 
about 10 percentage points more likely to answer a call 
than those who were assigned the civic duty appeal 
(p-value, 0.00073). For personal visits, one finds no 
systematic variation in compliance rates among differ- 
ent appeal messages; Pearson’s x? test shows that one 
cannot reject the null hypothesis of equal compliance 
rate for all three appeal messages (p-value, 0.71)."* 
The different compliance rates for phone calls indi- 
cate the kind of nonrandom treatment assignment that 
could also explain the negative effects observed in 
Gerber and Green’s IV analysis. 

Even when it is impossible to check the randomiza- 
tion with respect to unobserved variables, one can con- 
duct tests for observed variables. I apply the following 
method, which can be used to jointly test whether all 
observed covariates are balanced. First, I use a logistic 
regression to predict the assignment of each treatment 
using all covariates and their first order interactions 
as covariates. If the model predicts treatment assign- 
ment well, this represents evidence that treatment as- 
signment was not randomized. If treatment assignment 
is completely random, then assignment should be in- 
dependent of all covariates and any function of those 
covariates.'® Finally, I conduct the residual deviance 
test to examine whether these covariates together 
significantly help predict the treatment assignment 
(McCullagh and Nelder 1989, 119). 

Table 6 presents the p-values of this test using the 
x distribution. The p-values are very small, indicating 
the failure of randomization for all three treatments in 
both original and revised data. For example, the test 
for postcard mailings implies that the departure from 


D The test assumes that the content of messages does not affect 
compliance. This assumption may be justified because all messages 
have the identical openmg script. Also, the scripts are relatively short; 
telephone scripts lasted only for about 30 seconds (Gerber and Green 
2000, 656). 

14 The result holds even when looking at the incorrect treatment 
assignment and control groups used in the original analysis. The 
mean difference for telephone canvassing is five percentage points 
(menificant at the 0 01 level), while for personal canvassing differ- 
ences across messages are not significant. 

15 Due to the small suze of its treatment group, for phone calls, only 
the past voting record and the household type variables are inter 
acted with the other covariates. 

16 Tf there are enough observations, other functional forms can be in- 
cluded in the model 1n order to allow for a more complex correlation 
structure. 
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Original Data 


Mall 0.0000000035 


tells us that under the assumption of successful 


sample sizee. 


randomization observed in Gerber and Green’s data 
can occur only with a probability of one in 300 million. 
This probability is smaller for the revised data, reaching 
to one in 2 billion. (Note that a small sample size makes 
it harder to detect failure of randomization, so that 
the larger p-value for phone calls than for visits and 
mailings does not necessarily imply that randomization 
was more successful.) In sum, the test with respect to 
observed covariates also provides strong evidence that 
treatment assignment was not randomized in Gerber 
and Green’s field experiment. 

In field experiments, randomization of treatment as- 
signment is not as easy to accomplish as one might 
expect. In practice, it is often difficult to randomize 
every aspect of each treatment. In Gerber and Green’s 
experiment, personal canvassing was conducted over a 
period of four weeks before the election, whereas tele- 
phone canvassing took place over three days including 
the election day. Postcards were sent out during the 
two weeks before the election. Although a visit right 
before the election would have a greater effect than a 
visit one month before the election day, the timing of 
contact was not randomized. Likewise, the effect of dif- 
ferent canvassers, if not randomized, can confound the 
effect of different canvassing methods. These examples 
illustrate the difficulty of randomization and potential 
confounding effects that threaten the validity of field 
experiments. 

Finally, I investigate the sources of the negative 
finding about phone calls. Both Gerber and Green’s 
analysis and the corrected IV analysis indicate that 
telephone canvassing has a large and negative effect 
on voter turnout among single-voter households. I find 
that for this subgroup the assignment of phone calls 
was not randomized with respect to the past voting 
record. In particular, only 42% of the treatment as- 
signment group voted in the last election, whereas 
47% of the control group voted (p-value, 0.05). The 
randomization for this group appears to be incom- 
plete even with the incorrectly identified treatment 
assignment and control groups used by Gerber and 
Green.!” Since those who voted in the last election are 


17 Compared with the control group, the treatment assignment group 
includes significantly more indrviduals who abstained in the last elec- 
tion. The mean difference 1s statistically significant at the 0.05 level. 


TABLE 6. Probabllity of Successful Randomization with Respect to 
Observed Covariates In Gerber and Green’s Fleld Experiment 


7,389 

Do a ee 

Note: Probabiltty represents the p-value of the residual deviance test from a logistic regression model pre- 

dicting the assignment of each treatment grven all obeerved covariates and tharr first-order Interactions 

N represents the size of the treatment assignment group The last row in the second column, for example, 
randomization 


for mailings observed In Gerber and Green's orignal data would occur only with a probability of about one 
in 300 million. These probabilities cannot be compared across different treatments because of different 
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Revised Data 
















2,615 
7,190 


, the pattern of incomplete randomization 


40 percentage points more likely to vote in the current 
election on average, this difference contributes to the 
large negative effects of phone calls for single-voter 
households. 


When One Should Not Use IV Estimation 


The large bias of IV estimation that results from vio- 
lation of the exclusion restriction is well documented 
(e.g., Angrist, Imbens, and Rubin 1996, 450). In partic- 
ular, the bias is worsened when unbalanced variables 
are good predictors of the outcome variable and when 
a large number of noncompliers exist. Equation (4) 
illustrates these two conditions; the bias of the IV es- 
timate is large (a) when the bias of the ITT estimate 
due to incomplete randomization is large and (b) when 
the compliance rate is low. (Recall that the IV estimate 
is equal to the ITT estimate divided by the estimated 
compliance rate.) 

Gerber and Green’s study fits both conditions for 
large bias. First, the unbalanced covariates (i.e., the 
voting record in the previous election) predict turnout 
well, which suggests that the bias in the estimated ITT 
effect is large. Furthermore, the compliance rate of this 
field experiment is low (about 25% for phone calls). 
This low compliance rate implies that if the ITT effect is 
biased by five percentage points, for example, then the 
bias of the IV estimate can be as large as 20 percentage 
points. Thus, the combination of a large bias in the ITT 
estimate and low compliance rate led to the puzzling 
finding that get-out-the-vote calls significantly decrease 
turnout.!8 

If one successfully randomizes the treatment as- 
signment, the method of instrumental variables can 
give estimated treatment effects that are consistent in 
large samples. However, as the analysis of this section 
suggests, making this assumption in practice requires 
careful experimental design and successful implemen- 
tation. In this case, the failure of randomization for tele- 
phone canvassing led to inaccurate causal inferences 


18 Tt 1s also important to note the finite sample bias and inefflaency 
of IV estimation (e.g, Bound, Jaeger, and Baker 1995). The small 
size of each treatment group in the New Haven mobilization study 
suggests the importance of finite sample consideration. 
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TABLE 7. Dtfferences In Observed Characteristics between Compliers and Control 


Group Prior to Matching Adjustment 


Phone Call 
t Stat. Var. Ratlo 


Varlable Mean DIF. 


9.01 7.00 
18.8% 6.41 
—8.9% —4.32 
5.5% 1.95 
0.6% 0.40 
2.6% 0.79 


Voted In '96 election 
Newly registered voter 
Registered Democrat 
Registered Republican 
Two-voter household 


Mean Diff. 
3.22 
3.9% 
—0.5% 

3.0% 
—1.2% 
—0 3% 


Nota: The table shows the differences in covanate distributions due to noncompliance. The mean of each covariate for 


the contro! group is subtracted from that for the treatment group. The t statistics for these mean differences are also 
. The vanance ratios are calculated by dividing the vanance of the treatment group by that for the contro! group. 
Matching would be unnecessary If mean differences were near zero and the vanance ratios were near one. 


about the effects of get-out-the-vote calls in Gerber 
and Green (2000). 


ANALYSIS WITHOUT ASSUMING 
COMPLETE RANDOMIZATION 


The previous section showed that IV estimation was 
inappropriate for telephone canvassing given the in- 
complete randomization of treatment assignment. This 
calls for more general statistical methods to estimate 
the effects of nonrandom treatments. I apply propen- 
sity score matching to reduce the bias caused by non- 
random treatment.” Matching is particularly useful for 
field experiments when randomization of treatment as- 
signment is incomplete and important covariates are 
available. The basic idea of matching follows the logic 
of causal inference described earlier. The goal is to 
construct a control group as similar to the treatment 
group as possible. The method of matching finds two 
groups of subjects who have exactly the same observed 
characteristics except that one receives the treatment 
and the other does not. Since matching is a nonpara- 
metric method, it does not require the assumptions of 
usual regression analysis, (e.g., linearity and additivity), 
and hence it effectively reduces bias due to incomplete 
randomization. 

The intuition behind matching resembles the tra- 
ditional comparative case study method, which dates 
back to John Stuart Mill (1930/1843). Both approaches 
call for comparing cases that are very similar to each 
other except for the primary causal variable. This fa- 
cilitates the evaluation of main causal effects in iso- 
lation by reducing the possibility of confounding ef- 
fects from other variables, Although the comparative 
method has largely been used for qualitative studies, 
with the method of matching, quantitative and histori- 


19 The estimand for the method of matching (Le~, the average treat- 
ment effect for the treated) can differ from that for IV estimation (Le., 
the average treatment effect for compliers). In the New Haven mo- 
bilization study, however, the two estumands are equivalent because 
the treated did not include “always-takers,” who take the treatment 
even when they are not assigned the treatment (L.e. it is assumed 
that T; = 0 of Z = 0). See Angrist, Imbens, and Rubin 1996 for a 
complete discussion of this issue. 
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cal case studies can rest on a common ground of causal 
inference. 


Selection Blas Due to Noncompllance 


In field experiments, even when treatment assignment 
is completely randomized, the actual treatment group 
of compliers (7; = Z, = 1), as opposed to the treat- 
ment assignment proup (Z, = 1), is often different from 
the control group (7; = Z, = 0) in its characteristics. 
Table 7 illustrates the imbalance of observed covari- 
ates between compliers and the control group. The 
wide gap between the two groups indicates a signifi- 
cant selection bias that requires statistical adjustment. 
Compliers are older, are more Democratic, and have 
a better past voting record than the control group. A 
similar pattern is observed in the revised data. Esti- 
mates of treatment effects will be biased, unless one 
properly adjusts for these systematic differences be- 
tween the two groups. Next, I explain how propen- 
sity score matching effectively reduces this selection 
bias. ' 


Matching 


The key assumption of matching is that compliers in 
the control group can be identified using their observed 
characteristics. In other words, the assumption implies 
that it is possible to estimate the counterfactual out- 
come under no treatment for a treated unit by using 
individuals from the control group who share the same 
observed characteristics. Formally, the counterfactual 
outcome under no treatment, Y{T = 0), is assumed to 
be mean independent of the actual treatment status, T, 
conditioning on the set of observed control variables, 
X (e.g., Heckman et al. 1998), 


EXT = 0)|T =1,X} = E{¥(T =0)| T=0, X}. 
(5) 


Equation (5) implies that matching effectively re- 
duces bias when important covariates are observed. 
Omitted variable bias is possible if the observed covari- 
ates, X in Equation (5), do not contain variables that 
affect both T and Y(T = 0). The bias can be reduced, 
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however, if those omitted variables are highly corre- 
lated with X. An advantage of matching is that this 
conditional independence assumption does not require 
parametric functional forms common to usual regres- 
sion analysis such as linearity and additivity (see Ho, 
Imai, King, and Stuart 2004). If the assumption of 
Equation (5) is met, matching gives an unbiased es- 
timate of average treatment effect by integrating over 
the distribution of xX, 


E(YT =1)- ¥{T=0)|T=1) 
= Ex[E(¥T = 1)|T = 1, X} — E{Y(T=0)|T = 0, X]. 
(6) 


Propensity Score 


Unfortunately, the application of exact matching be- 
comes practically impossible as the number of control 
variables increases. For example, one might be able 
to match on voting records but not on age, ward of 
residence, etc. This curse of dimensionality implies that 
when many control variables are present, the standard 
regression analysis with its restrictive parametric as- 
sumptions often fails to pinpoint the correct functional 
relationship among the outcome and treatment vari- 
ables. Even with the nonparametric method of match- 
ing, it is often difficult to find control units whose 
characteristics match exactly with a treated unit for all 
dimensions. 

The use of the propensity score, defined as the con- 
ditional probability of receiving a treatment, aids the 
method of matching in such multivariate settings. It can 
be shown that this single variable summarizes relevant 
information in all observed control variables. Then, 
one only needs to match on this scalar variable, which 
is much more feasible than matching on the entire 
vector of X. More formally, Rosenbaum and Rubin 
(1983) show that conditioning on the propensity score, 
e(X) = Pr(T = 1X), is equivalent to conditioning on 
all observed characteristics, X. Hence, without addi- 
tional assumptions, Equation (5) can now be replaced 
with 


EIT =0) | T=1, e(X)} = E(Y(T=0) | T=0, e(X)}. 
(7) 


In most cases, however, the true propensity score 
is unknown to researchers. Thus, one must estimate 
it by modeling the actual receipt of treatment given 
observed covariates. The logistic regression can serve 
this purpose, although semiparametric and nonpara- 
metric methods can also be employed. Whatever model 
is used, the estimated model itself carries little causal 
interpretation and should be regarded as a tool to cre- 
ate a matched control group similar to the treatment 
group. If the propensity score is estimated properly, 
the distribution of observed covariates should be simi- 
lar between compliers and matched control units. One 
has to change the model specification and reestimate 
the propensity score until this balance is achieved. An 
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important advantage of propensity score methods over 
usual regression analysis is this diagnostic test that di- 
rectly assesses the validity of causal inferences. 

Although it is known to effectively reduce bias 
caused by nonrandom treatment (e.g., Dehejia and 
Wahba 1999), propensity score matching, like any other 
statistical method, risks bias due to omitted variables. 
That is, the method can only balance observed char- 
acteristics of the treatment and control groups. For 
example, if “politically interested” voters whose char- 
acteristics are not captured by observed variables are 
more likely to talk to a canvasser and go to polls, then 
propensity score matching may yield biased estimates. 
Estimates based on propensity score matching are also 
biased when the treatment group is too different from 
the control group. 

In Gerber and Green’s study, however, these prob- 
lems are unlikely for three reasons. First, since the 
large control group roughly represents the population, 
we know that compliers exist in this group. Second, 
as shown later, I find many exact and close matches, 
indicating that the bias due to inexact matches is min- 
imal. Third, when the covariates measuring impor- 
tant characteristics of subjects are available, propensity 
score matching is a powerful method for reducing bias. 
The availability of the voting record of the previous 
election is critical for successful matching in Gerber 
and Green’s data. The ability to match on this vari- 
able allows further bias reduction by balancing unob- 
served variables that are correlated with the past voting 
record. 

Although propensity score matching only uses a sub- 
set of the control group, the comparison of treated 
units with a matched control group gives more reliable 
estimates of treatment effects. When treatment assign- 
ment is not completely random and important covari- 
ates are observed, propensity score matching is the 
best available statistical method. Certainly, it is more 
appropriate than the method of instrumental variables 
used by Gerber and Green. Under these conditions, the 
method can also be applied to observational studies. 
Imai and van Dyk (2004) extend the propensity score 
to nonbinary treatments that are often encountered in 
observational studies. This generalization widens the 
potential applications of propensity score beyond ex- 
perimental settings. 


Application of Propensity Score Matching 
and Diagnostics 


I apply the procedure referred to as nearest-neighbor 
propensity score matching to the New Haven voter mo- 
bilization study (Rosenbaum and Rubin 1985a, b). The 
goal is to select a group of voters of the 10,800 voters in 
the control group such that the distribution of covari- 
ates for the matched control group is similar to that for 
the treatment group. For each treatment unit, I select 
a control unit whose propensity score is the closest.” 


2 I randomly order the treatment and control units before conduct- 
ing matching. 
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TABLE 8. Similarity of Observed Covariates between Treatment Compllers 
and Matched Control Groups 


Phone Call 


Variable Mean DI. 
0.23 


—0.8% 


t Stat 


Age 

Voted In ‘86 election 
New registered voter —1.0% 
Reglstered Democrat 1.4% 
Reglstered Republican —0.2% 
Two-voter household 1.9% 
Ward of resktence 


Exact match 


0.54 


Var. Ratio 


25.5% matched 
19.3% matched 


Personal Visit 


Mean Diff tStat. Var. Ratio 
0.21 1.00 
—0.06 1.00 
—0.16 0.99 
—0.61 1.03 
0.36 1.07 
0.08 1.00 
35.4% matched 


25.7% matched 


Note: The table shows that matching effectrvety balances the observed covanates The mean of each 
covanate for the control group ts subtracted from that for the treatment group. The t statistics for these 
mean differences are also reported The variance rates are calculated by dividing the vanance of the 


treatment group by that for the control group 


with Table 7, the mean differences are closer to 


zero and the varlance ratios are closer to one, indicating that the covariate balance of the two groups is 


significantly umproved by matching 


If there is more than one voter with the same propen- 
sity score, I randomly select one of them. I repeat this 
procedure to obtain several matched control units for 
each treated unit; five matches for phone calls, yield- 
ing 1,210 selected control units, and three matches for 
personal visits and mailings (three postcards), yielding 
2,268 and 7,125 matched control units, respectively. In- 
creasing the number of matched control units generally 
improves the efficiency of resulting estimates because 
more observations are included in the analysis, but it 
will typically produce a greater imbalance of covariates 
between treated and matched control units, which in 
turn may lead to biased estimates. As shown below, 
different matching schemes can also be used for sensi- 
tivity analysis to detect this potential bias. 

To estimate the propensity score, I use logistic re- 
gression starting with the specification where I include 
all available covariates as linear predictors. When this 
model does not balance all covariates, I search for an 
alternative specification by including additional terms 
to improve the balance.” I use mean differences and 
variance ratios to investigate the resulting balance of 
covariates and determine model specification. Since all 
covariates except age of voters are indicator variables, 
these two statistics are generally sufficient to measure 
the similarity of the covariate distributions between 
treated and matched control units. The availability of 
such diagnostic tests for model specification is an im- 
portant advantage of propensity score methods. 


21 The model specifications for the orginal data are as follows. For 
phone calls, the household type variable is interacted with past voting 
record. For personal visits, the household type is interacted with the 
other variables except the new voter variable Both models include 
the square term of age. For mailings, the household type is interacted 
with age, past voting record, and ward of residence vanables. The 
model specifications for the revised data are as follows. For phone 
calls, the square term of age and the two interaction terms of the 
household type, one with the past voting record and the other with 
the new voter variable, are added For personal visits, the interaction 
terms of the household type with the other variables except the 
past voting record are added For mailings, the household type was 
matched first, and all first-order interaction terms are included. 
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Table 8 shows that matching on the estimated 
propensity score successfully balances all observed 
covariates. The mean differences of all covariates 
between the treated units and the control-group 
individuals are not statistically significant and their 
variances are similar. In particular, propensity score 
matching significantly improves the balance of covari- 
ates compared with Table 7. I also find many exact 
matches. For phone calls, about one fifth of the matched 
control units share exactly the same values of all co- 
variates with one of the treated units. That is, they live 
in a household with the same number of registered 
voters, are exactly the same age, have the same party 
affiliation, reside in the same ward of New Haven, and 
have the same voting record in the previous election. 
Similarly, in the case of personal visits, I find about one 
fourth of the matched control units to be exact matches. 

Figure 2 further compares the similarity of the two 
groups by examining the distributions of the estimated 
propensity score. Since the propensity score is a scalar 

of all observed covariates, successful match- 
ing should produce a matched control group whose 
propensity score distribution is similar to that of the 
treatment group. While the distributions of the treat- 
ment group (indicated by the gray density) and control- 
group individuals (indicated by the solid line) are sub- 
stantially different before matching, they are almost 
identical after matching. 

Finally, the same test as shown in Table 6 can be 
applied to the matched sample. I use the same logistic 
regression to predict the receipt of each treatment in 
the sample that combines those who received the treat- 
ment with a group of compliers selected by matching. 
If matching is successful, the model should not pre- 
dict the receipt of any particular treatment well. The 
results show that after matching, the model no longer 
predicts the receipt of treatments. Indeed, using the 
original data, the p-values for phone calls, personal 
visits, and postcard mailings are 0.63, 0.67, and 0.65, 
respectively. For the revised data, the results are 0.84, 
0.88, and 0.99. The large p-values contrast with the 
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FIGURE 2. Distributions of Propensity Scores for Treatment and Control Group Before and After 
Matching Adjustment 


Phone calls 


0.10 


0.10 


Mote’ The graphs are smooth versions of histograms produced with Gausslan kemels. Gray areas and solld linee represent the 


distributions of 


scores for treatment and control groups, respectively. Before matchmg adjustment, the two distributions are 


quite c&fferent (left). After matching, however, they are almost identical (nght) 


results in Table 6, confirming that the matched sample 
balances the covariates between the treatment and the 
control groups. 

The effectiveness of matching illustrates an impor- 
tant advantage of randomized field experiments. In 
many observational studies, it is often difficult to con- 
duct matching adjustment because the treatment group 
is too different from the control group. For such cases, 
even the propensity score may prove inadequate. In 
field experiments, such problems are less likely because 
the control group tends to be a representative sam- 
ple of the relevant population. Despite the random- 
ization problems for phone calls, Gerber and Green’s 
study produced treatment assignment and large con- 
trol groups for which propensity score matching can 
effectively balance all covariates. 


GET-OUT-THE-VOTE CALLS INCREASE 
TURNOUT 

After matching with the estimated propensity score, I 
calculate the average treatment effects of phone calls 








and personal canvassing as well as the average ITT ef- 
fects of mailings (three postcards). Table 9 presents the 
matching estimates for revised data. The results based 


TABLE 9. Matching Estimates of Average ITT 
and Treatment Effects on Voter Turnout 
(Percentage Polnts) 
Phone Visit Mall 
Overall effect 6.5 9.2 1.5 
(3.2) (2.1) (1.0) 
Single-voter households 6.9 9.6 0.7 


(4.8) (3.1) (1.7) 

Two-voter households 6.1 8.9 2.2 

4. 2.9 1.2 

average treatment effects are estimated for personal 

visits and phone calls, while the average ITT effects are est- 

mated for mail canvassing (three postcards). The results are 
replications Standard 


Note: The 


errors are in 


based on 500 bootstrap 
parentheses. 
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FIGURE 3. Comparison of Matching Estimates and Gerber and Green’s Results for Average 


Treatment Effect of Get-Out-the-Vote Calls 
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Estimated Treatment Effect of Get-Out-The—Vote Calis 


Note: The estimated average treatment effect of phone calls. The Normal distribution is used to approximate the distribution of estimates. 
While the matching estmates Indicate that phone calls have a positive Impact on tumout, Gerber and Green's results (APSA 2000, 


solid line, Web site 2002, dashed Ine) Impty otherwise 


on the original data are similar.” The results show that 
get-out-the-vote calls increase turnout by a little more 
than six percentage points on average (with a standard 
error of 3 percentage points), reversing the negative 
finding reported in Gerber and Green (2000). While it 
may not appear as effective as personal visits, telephone 
canvassing offers a significant alternative mobilization 
strategy. The matching estimate for personal visits is 
significantly smaller than the corrected IV estimate. 
The estimated ITT effect of sending three postcards 
is about two percentage points. Mailing appears to be 
especially effective for two person households, suggest- 
ing that along with phone calls, mail canvassing may 
represent another cost-effective mobilization strategy. 
Although the overall effect of postcards may appear 
to be smaller than that of phone calls and visits, such a 
simple comparison is misleading. While the ITT effect 
of postcards is estimated for the entire treatment as- 
signment group, the treatment effects of the subgroup 
of compliers are estimated for the other canvassing 
methods. In particular, it is possible that postcards may 
be as effective for compliers as phone calls and visits 
are for this subgroup. Unless we have the information 
about who actually read postcards, it is difficult to iden- 
tify the treatment effect of postcards for compliers. 
Figure 3 compares the matching estimates with the 
original results reported in Gerber and Green (2000) as 
well as the estimates posted on their Web site (see foot- 
note 4). (When analyzing the revised data, Gerber and 
Green incorrectly identify their treatment and control 
groups. Thus, their estimates differ from the corrected 
IV estimates reported in Table 5, which are based on 
the actual treatment assignment and control groups.) 
The conclusions one would draw from two statistical 
methods are clearly different. Matching shows that get- 


2 The results for the original data are as follows 7.1% (32) for 
phone calls, 85% (2.1) for vists, and 2.2% (1.1) for postcards, where 
standard errors are in parentheses 
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out-the-vote calls increase turnout, whereas Gerber 
and Green’s [V analysis indicates that such calls may 
discourage voters from casting their ballots. Although 
Gerber and Green’s Web site results are somewhat 
closer to my matching estimates, the difference shows 
that the data correction alone is not sufficient to fix all 
the problems that have occurred when implementing 
their field experiment. 

The positive finding about telephone canvassing 
agrees with the results of another experimental study 
recently conducted in a different setting by the same 
authors as well as the earlier experimental results (e.g., 
Adams and Smith 1980, Eldersveld 1956, and Miller, 
Bositis, and Baer 1981). In their recent study, Green 
and Gerber (2001, 2) conclude that “phone canvassing 
increased turnout by an average of 5 percentage-points. 
This finding, based on six experiments involving nearly 
10,000 people, is statistically significant.”” Given that 
making a phone call costs much less than visiting a 
home, get-out-the-vote calls may be the most cost- 
effective mobilization strategy. 


Sensitivity Analysis 


I conduct two kinds of sensitivity analysis. First, I in- 
vestigate how the matching estimates differ between 
the two types of households. The instability of IV esti- 
mates for phone calls in the original data was apparent 
from the discrepancy between the large negative ef- 
fect for single-voter households and the moderately 
positive effect for two-voter households. In contrast, 
the estimates based on matching show smaller gaps 
between the treatment effects for the two types of 
households. 

I also perform one-to-one matching to examine 
whether it produces different estimates. One-to-one 


DB These findings were given to me after I sent Don Green the initial 
vernon of this article. 
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matching is not as efficient as one-to-many matching 
because a smaller subset of the data is used. However, 
it may be less biased since many of the selected control 
units can be exactly matched. If the results based on 
one-to-many matching are significantly different from 
those of one-to-one matching, therefore, we may con- 
clude that the former suffers from a large bias.?* In the 
case of Gerber and Green’s data, one-to-one matching 
gives similar results. In particular, get-out-the-vote calls 
increase turnout by five and six percentage points on 
average, respectively, for the original and the revised 
data. Together with the model specification tests of 
the previous section, these sensitivity analyses indicate 
that there is minimal bias in the matching estimates 
reported in Table 9. 


CONCLUDING REMARKS 


Although Gerber and Green correctly argue that field 
experiments have many advantages over observational 
studies, they are incorrect to claim that field ex- 
periments only require “rudimentary data analysis.” 
Rather, statistical methods are essential for the anal- 
ysis of field experiments. Unlike laboratory experi- 
ments, field experiments are performed in real world 
settings. This tremendous advantage of field experi- 
ments is, however, accompanied by complications that 
commonly arise in the real world. While some of these 
complications can be avoided by a better experimental 
design and more careful implementation, other prob- 
lems will normally need to be addressed when conduct- 
ing the data analysis. 

The approach recommended in this article detected 
the implementation errors of Gerber and Green’s ex- 
periment. In light of the fact that even this prominent 
field experiment encountered such problems, it is advis- 
able to apply comprehensive diagnostic analysis such 
as the methods suggested in this article to all data 
generated by field experiments. More than 60 years 
ago, Ronald A. Fisher (1938), who introduced the con- 
cept of randomized experiments, stated, “To call in 
the statistician after the experiment is done may be 
no more than asking him to perform a postmortem 
examination: he may be able to say what the exper- 
iment died of” Since then, the field of statistics has 
made methodological advancements for the analysis of 
quasi-experimental and non-experimental data. These 
statistical methods can not only find the problems, but 
also make necessary adjustments for flawed implemen- 
tation of a field experiment. 

The reanalysis of Gerber and Green’s field experi- 
ment shows that get-out-the-vote calls increase turnout 
rather than decrease it. Along with phone calls, post- 
cards also appear to mobilize voters at relatively low 
cost. After their analysis, Gerber and Green (2000, 662) 
reached the rather pessimistic conclusion that “The 
question is whether the long-term decay of civic and 
political organizations has reached such a point that 


% I report the results for matching without replacement, but the sen- 
sitivity analysis using matching with replacement produced similar 
results 


our society no longer has the infrastructure to conduct 
face-to-face canvassing on a large scale.” In contrast, 
my findings allow greater optimism for how to reinvig- 
orate democracy. A simple phone call or postcard can 
make a difference. 

Gerber and Green’s study was one of the first large- 
scale field experiments conducted in the discipline in 
more than half a century. As more experience with field 
experiments accumulates, political scientists will learn 
how to use this promising methodology even more ef- 
fectively. Nonetheless, there will always be unforeseen 
complications in the field. The real world is a messy 
place, and only with statistical methods continuously 
adapted to the problem at hand are we able to make 
valid causal inferences. 
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Correction to Gerber and Green (2000), Replication of Disputed 


Findings, and Reply to Imai (2005) 


ALAN S. GERBER and DONALD P. GREEN Yale University 


is essay corrects the results reported in Gerber and Green 2000 and replies to Imai (2005). 
When data-processing errors are repaired, the original substantive findings from the New Haven 
experiment remain unchanged. As previously reported, brief phone calls do not increase voter 
turnout. The “correction” that Imai (2005) offers, which purports to show that these phone calls produce 
large, significant, and robust increases in voter turnout, is shown to contain statistical, computational, and 
reporting errors that invalidate its conclusions about the effectiveness and cost-effectiveness of phone 
calls and mail A replication of the New Haven experiment reproduces both the findings reported in 
Gerber and Green 2000 and the biases of Imai’s statistical analysis. 


e are grateful to Kosuke Imai, who, in the 
course of replicating and extending our exper- 
imental analysis (Gerber and Green 2000), 
brought to our attention data-processing problems as- 
sociated with our 1998 experiments on voter mobi- 
lization. However, the “correction” of our findings 
reported in Imai 2005 contains numerous statistical, 
computational, and reporting errors. The purpose of 
this essay is twofold: to offer a correction of our original 
findings and to demonstrate that the statistical analysis 
conducted by Professor Imai is systematically biased in 
favor of finding spurious effects. 

The following section presents the corrected esti- 
mates from Gerber and Green 2000. The new version 
of the New Haven dataset shows no signs of random- 
ization failure and generates statistical results that are 
substantively indistinguishable from what was reported 
in Gerber and Green 2000. Face-to-face visits increased 
turnout substantially; direct mail, modestly. Brief paid 
phone calls were ineffective. The section concludes 
with a replication of the New Haven study involving 
more than a million subjects. The results again show 
phone calls to be ineffective. The 6.5—percentage point 
effect claimed by Professor Imai (2005) is rejected at a 
significance level of less than one in a trillion. 

The subsequent section explains why Professor 
Imai’s statistical approach exaggerates the effective- 
ness of phone calls. We describe the faulty assump- 
tions underlying his model and the nonstandard way 
in which he implemented propensity score matching. 
Replication of Professor Imai’s (2005) analysis reveals 
a series of errors that call into question the magnitude, 
significance, and robustness of his matching estimates. 
This section concludes by showing that the biases of 
Professor Imai’s matching analysis are not specific to 





Alan S. Gerber is Professor, Institution for Social and Policy Studies, 
TI Prospect Street, PO. Box 208209, Yale University, New Haven, 
CT 06520-8209 (alan.gerber@yale.edu). 

Donald P. Green is Professor, Institution for Social and Policy 
Studies, 77 Prospect Street, P.O. Box 208209, Yale University, New 
Haven, CT 06520-8209 (donald.green@yale.edu). 

The authors are deeply mdebted to Kevin Arceneaux, who helped 
with every facet of this project. They also wish to thank Jack Citrin, 
Mo Fiorina, John Geer, John Krasno, Jeff Lewis, Doug Rivers, Jas 
Sekhon, Jeff Smith, Petra Todd, and three anonymous reviewers 
who commented on earlier drafts, and Robert Gentleman and John 
Immel, for their technical help. 


the New Haven study; they can be reproduced using 
other experimental data. 

Professor Imai raises a number of interesting and im- 
portant methodological topics, ranging from research 
design to data analysis. However, this essay alerts 
readers to several instances in which Professor Imai’s 
methodological advice is ungrounded in statistical the- 
ory and deviates from standard practice. The random- 
ization check that he proposes produces incorrect p 
values and does not properly detect problems in the 
random assignment of experimental groups. His as- 
surance that propensity score matching reliably esti- 
mates treatment effects is based on a dated and selec- 
tive reading of the relevant literature, and at no point 
does he establish, analytically or empirically, the su- 
periority of matching over other estimation methods, 
either in general or in this application. We show also 
that Professor Imai’s advice about how to compare 
the cost-effectiveness of alternative get-out-the-vote 
tactics rests on an algebra mistake; using the correct 
formula confirms our original conclusions. His criti- 
cism of factorial design is groundless, both empirically 
and theoretically. His essay repeatedly points out the 
importance of assorted interactions and nonlinearities 
yet neglects to subject any of them to significance tests; 
doing so reveals that they are all insignificant. In sum, 
not one of the key substantive or methodological claims 
of Professor Imai’s essay survives scrutiny. 


CORRECTION OF PUBLISHED RESULTS 


Returning to the original data files, we reconstructed 
both the household-level and the individual-level New 
Haven study for purposes of preparing this correc- 
tion. We corrected errors associated with (1) imper- 
fect matches among names returned by the phone 
bank, names returned by canvassers, and names on 
the original master file, (2) a miscommunication be- 
tween us and the phone bank about which treatment 
groups were to be assigned a get-out-the-vote ap- 
peal, and (3) data manipulation errors that caused 
some people in the control group to be incorrectly la- 
beled as treatment subjects.! The reconstructed dataset 


1 The datasets that we prepared for Professor Imai in June 2002 
addressed problems 2 and 3 The correction to problem 3 accounts 
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includes all of the observations in the original as- 
signment, including those not found on the cross-off 
sheets supplied by the registrar of voters after the 
1998 election. The corrected data file may be found 
at http://research.yale.edu/vote/NewHaven1998.html, 
and it allows the option of analyzing all 31,098 obser- 
vations or the 29,811 observations with valid 1998 vote 
data. Since missing data are uncorrelated with the orig- 
inal random assignment, the decision of how to treat 
missing observations has trivial effects on the results, 
as demonstrated below. In this section, we analyze the 
corrected dataset; in the next section, when replicating 
Professor Imai’s matching analyses, we use the earlier 
versions of the dataset that he analyzed. 


Randomization Check 


Before analyzing the data, we first demonstrate that 
nothing is amiss in the random assignment of sub- 
jects to treatment groups. Since the data were random- 
ized at the level of the household, this randomization 
check must also be performed at the household level 
(N = 23,450). Performing the randomization check 
at the individual level, as Professor Imai does (his 
Table 6), provides grossly misleading results because 
individuals living in the same household share the 
same background characteristics, such as voting ward. 
Monte Carlo simulations reported by Arceneaux, 
Gerber, and Green (2004) demonstrate that even when 
the household-level assignments to mail, phone calls, 
and visits are generated at random, estimating the ran- 
domization check at the individual level incorrectly 
declares one or more of these randomizations flawed 
in more than 95% of the simulated experiments. Pro- 
fessor Imai cites McCullagh and Nelder (1989, 119) as 
an authority for his test, but they warn (118, assump- 
tion 1) that their result hinges on the assumption that 
the observations are independent. Due to this error, all 
of the p-values reported in his Table 6 are incorrect. 
Using logistic regression, we regressed each ex- 
perimental assignment—mail, phone, and face-to-face 
canvassing—on all of the background information 
available for each household: the ward of residence, 
the number of voters in the household, the age of each 
voter, a dummy variable for missing age information, 
dummies indicating whether each voter abstained in 
1996, dummies indicating whether each voter voted in 
1996 (the omitted category was not being on the voter 
rolls in 1996), and dummies indicating whether each 
voter was registered with one of the two major parties. 
This exercise generates chi-square statistics with 39 de- 
grees of freedom. We do not follow Professor Imai’s 


for the change ın the number of treatment group observations from 
958 to 805 in his Tables 1 and 2 and explains the slight increase in 
the contact rate reported ın bis Table 4. Problem 1 did not become 
apparent until a year later, when, with the assistance of a database 

er, we reconstructed the dataset from the four orginal 
data files (the master file of treatment assignments, the names and 
contact information reported by the phone bank, vote histories, and 
results from the door-to-door canvass). This proceas also corrected 
errors m the party and age variables 
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procedure of including more than 100 first-order inter- 
actions among these variables? Monte Carlo simula- 
tion shows that even when applied to household-level 
data, his method of saturating the model with interac- 
tions leads to rejection of the null hypothesis of random 
assignment at more than twice the rate implied by the 
critical chi-square value used in his test (Arceneaux, 
Gerber, and Green 2004). This error further distorts 
the results presented in his Table 6. 

Applying the proper randomization check to these 
data reveals that the p-values from these regressions 
are nonsignificant: .95 for mail, .78 for phone, and .39 
for face-to-face canvassing. A superior way to assess 
randomization is to analyze assignment to all eight 
experimental permutations simultaneously. A multi- 
nomial logistic regression with 273 degrees of free- 
dom produces a nonsignificant x? of 261 (p = .70). 
(Employing Professor Imai’s method of including all 
first-order interactions raises the number of degrees 
of freedom to 1,764 and decreases the p-value to .27.) 
After deleting households that did not have valid vot- 
ing data in 1998, we obtain similar results (N = 22,449); 
the p-value for mail is .93; phone, .76; and face-to-face 
canvassing, .43. The multinomial regression produces 
a p-value of .60.° In sum, households were randomly 
assigned to experimental groups, and the randomiza- 
tion checks reveal no unexpected correlations between 
random assignment and voters’ background character- 
IStics. 

Professor Imai (2005, 290) concedes that two-stage 
least-squares (2SLS) is the best estimator for gauging 
treatment effects when the treatments are assigned at 
random. This estimator is the multivariate generaliza- 
tion of the instrumental variables estimator proposed 
by Angrist, Imbens, and Rubin (1996). The vote is 
regressed on actual treatment using treatment assign- 
ment as an instrumental variable (see Statistical Im- 
plications of Failure to Treat, below). The next section 
applies this estimator to the corrected experimental 
data. 


The Treatment Effects Are Substantively 
Identical to Those Reported In Gerber 
and Green 2000 


Using the corrected individual-level data, Table 1 re- 
ports the 2SLS regression estimates. Looking first at 
the model without covariates, we see that the estimated 
effect of face-to-face canvassing changes from 8.7 (pub- 
lished) to 8.4 (corrected). Mail’s effect changes from 
0.58 (published) to 0.51 (corrected). The coefficient 
estimate for phone calls remains negative. The pub- 
lished value of —4.7 is corrected to be —1.9. A similar 


2 In his footnote 18, Professor Imai (2003) advocates the use of even 
more interactions and higher-order terms than he was able to im- 
plement in this apphcation. For an example of a properly conducted 
randomrration check, see Ansolabehere and Iyengar 1995, 172. 

3 Nonsignificant p-values are also obtained for all the subsamples 
that Professor Imai analyzes. p = 54 when excluding households 
that received multiple treatments and p = 66 when further excluding 
households that received a placebo phone call asking for a blood 
donation. 
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TABLE 1. Published and Corrected Regression Results 


Without Covariates 


Personal contact 


Direct mailings (0 to 3) 


Telephone contact 


N 


Note. Dependent vanabte 1s voter tumout In 1998. Entnes are 2516 estimates, with elandard orrore in parenthoase. 

past voting In 1996, registraton as a Democrat or Republican, age, age squared, number of registered voters living In the household, 
and dummy variables marking the ward of residence. Standard errors for the corrected estimates take into account the clustering 
of Individuals within two-voter households. The first-stage equations include vanables indicating Intent to treat with personal contact, 
direct mall, and tekephone. The first-stage equation also inctudes covariatee, when these are used in the second-stage equatron. The 
second-stage equation includes the number of mailings sent, a dummy vanable mdicatng whether any member of the household was 
contacted face-to-face, and a dummy vanable Indicating whether any member of the household was contacted by phone. Both the 
published and the corrected regressions delete obeervations with missing data on voter turnout mn 1998 as well as obeervations with 
missing data for age Note that cases with missing data for age are included in Table A1. The version of the data analyzed by Imal 
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tes estimates that are almost identical to the correct coefficsents in this table, for phone calls, the estimated treatment 


(2005) genera 
effect ls —2.0 without covanates and —2.2 with covanates. 


pattern holds when we estimate the treatment effects 
controlling for voting history, age, party registration, 
and ward of residence. Again, the published estimates 
change from 9.8 to 9.0 (face-to-face canvassing), from 
0.63 to 0.59 (mail), and from —3.5 to —1.7 (phone). 
The inclusion of covariates reduces the standard errors 
slightly but otherwise has little effect on the results. 

Correcting the coding errors in the New Haven study 
changes none of the substantive conclusions that flow 
from it. Door-to-door canvassing’s effects are large and 
statistically significant using a one-tailed test. Mail’s 
effects are small and borderline significant using a 
one-tailed test. Phone’s estimates are negative and, of 
course, not significantly greater than zero using a one- 
tailed test. 

The results do not change when one discards the 
experiment’s factorial design. Professor Imai sharply 
criticizes factorial designs, referring to them in the 
text and tables as “incorrect.” (This usage has the un- 
fortunate consequence of creating confusion between 
our data-processing errors and the supposed “error” 
of assigning observations to multiple treatments.) His 
approach is to throw out all observations that were 
assigned to more than one type of voter mobilization 
treatment. We have more to say about this practice 
below. For the moment, note that the estimated effects 
of the phone treatment do not change substantively 
when one restricts the sample to this subset of obser- 
vations, although the standard errors prow consider- 
ably because the N in the phone treatment group (ex- 
cluding those assigned to the placebo condition) drops 
from 6,562 to 815. The phone treatment effect becomes 
—2.0 (SE = 6.2); with covariates, the estimate is —5.0 
(SE = 5.3). Professor Imai leaves the impression that 
the negative estimates produced by the New Haven 
study reflect randomization failures or the misguided 
use of a factorial design. It is now clear that the neg- 
ative estimates have nothing to do with randomization 





failure or use of multiple treatments. Table A1 (Ap- 
pendix) shows that, regardless of sample definition or 
model specification, all of the models produce negative 
estimates. Some estimates are more negative than the 
results reported in Gerber and Green 2000. 

What substantive inference do we draw from these 
negative estimates? Notwithstanding the title Profes- 
sor Imai selected for his essay, at no point did we assert 
that get-out-the-vote phone calls reduce turnout. What 
we said (Gerber and Green 2000, 660) was, “Given our 
initial expectation that telephoning increases turnout, 
we take this to mean that the null hypothesis of no 
effect cannot be rejected using a one-tailed test.”* On 
the first page of his essay, Professor Imai (2005) reports 
our conclusion this way: 


Gerber and Green (2000, 660) describe the negative effect 
of get-out-the-vote calls as “one of the most surprising 
results to emerge from our experiment.” 


But the passage that he quotes actually reads, “One of 
the most surprising results to emerge from our experi- 
ment is the ineffectiveness of telephone appeals.” 


Replication Confirms Original Results 


We also conducted a replication study of sufficient size 
to potentially refute our earlier conclusions. This study 


4 Professor Imai relegates a fragment of this quotation to a footnote 
and inserts into his text a quotation from a paper on a different 
experiment published in another journal (Gerber and Green 2001) 
He quotes from a footnote in the 2001 article, where, after findmg 
ERABE negative coefficients, we offer a speculation about why 
the commercial phone bank “conceivably” could have had a negative 
effect. The text of Gerber and Green 2001 (cf 76, 80, 81, 82) leaves 
no ambiguity about our mterpretation. We observed that “phone 
canvassing did not seem to affect voter turnout m our experiment” 
and that “get-out-the-vote calls produced no statistically significant 


increase in votmg rates.” 
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was designed to be above complaints about factorial 
designs or the inclusion of multiple individuals from 
the same household. Two national phone banks read 
a script much like the ones used in the 1998 study. 
Note that this was a nonpartisan script, as distinct 
from the partisan appeals that SA Imai (2005, 
283) alludes to when bolstering the intuition that 
phone calls must work or else parties would not waste 
money on them. As in 1998, this script was read dur- 
ing the weekend leading up to the federal midterm 
elections. 

The populations used for this study were registered 
voters in Iowa and Michigan. These states were chosen 
because they maintain high-quality computerized voter 
files with the vote history of each registered voter. We 
use this information later, when we apply matchıng 
to these data. The congressional districts of each state 
were divided into “competitive” and “uncompetitive” 
strata. Within each stratum, households containing one 
or two registered voters were randomly assigned to 
treatment and control groups.° Only one type of treat- 
ment was used: a get-out-the-vote phone call. Just one 
representative from each household was assigned to 
treatment or control; the other voter was ignored for 
purposes of calling and statistical analysis. Because 
only one member of each household was treated, no 
complications arise due to correlation within house- 
holds. A total of 60,000 individuals were assigned to 
be called; the corresponding control group contains 
1,846,885 individuals. At the time of this writing, voter 
turnout results for the November 2002 elections remain 
unavailable in two small Michigan counties. Eliminat- 
ing those observations reduces the treatment group to 
59,972 and the control group to 1,845,348. 

Given the enormous sample size, the experiment 
can easily detect the 6.5-percentage point effect that 
Professor Imai reports in his Table 9. The 2002 results, 
however, closely resemble our 1998 findings. The effect 
of the treatment-on-the-treated, controlling for the de- 
sign strata (competitiveness levels within each state), is 
0.4 percentage point.” Due to the massive N, the stan- 
dard error of this estimate is just 0.5, which means that 
the 99% confidence region extends from —0.9 to 1.7. 
The 6.5-percentage point effect generated by Professor 


5 The script read: “Hello, may I speak with [name of person] please? 
Hi. This is [caller’s name] calling from Vote 2002, a non-partisan 
effort working to encourage citizens to vote We just wanted to re- 
mind you that elections are being held this Tuesday The success of 
our democracy depends on whether we exercise our right to vote or 
not, so we hope you'll come out and vote this Tuesday. Can I count 
on you to vote next Tuesday?” 

6 As a randomization check, we used logistic regression to predict 
treatment based on vote ın 2000, age, number of registered voters in 
a household, and state house district. As expected, the chi-squares 
for each stratum are nonsignificant: Iowa noncompetitive (df= 24, 
p= 49), Iowa competitive (df= 63, p = 72), Michigan noncompeti- 
trve (df = 95, p =.60), and Michigan competitive (df=31, p = 23). 

7 The effects of phone banks are mmilar across states (Iowa, 
b=0 6, SE =0 6, Michigan, b=0.1,SE=0.8), competitiveness strata 
(uncompetitive districts, b=1.0, SE=07; competitive districts, 
b=—0 3, SE=07), and commercial phone ‘bank firms (first phone 
bank, b=0 6, SE =0 9, second phone bank, b=0 2, SE =0 6) The 
results remain unchanged when controls are introduced for past vot- 
ing behavior and age (b=0 4, SE=0.4). 
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Imai lies more than 12 standard errors away from the 
apparent parameter value. If the true parameter value 
were 6.5 percentage points, the eles i obtaining 
this experimental finding is less than 107 

The experiments just summarized do not mean that 
all forms of phone canvassing are doomed to fail. As 
we stated (Gerber and Green 2000, 661), these findings 
demonstrate only that brief, mechanically delivered re- 
minders to vote conducted by commercial phone banks 
have negligible effects on voter turnout. In subsequent 
research, we studied alternative phoning strategies to 
see how phone banking might be conducted more ef- 
fectively. In Green and Gerber 2001, we drew the dis- 
tinction between the commercial phone banks studied 
in Gerber and Green 2000, 2001 and volunteer phone 
banks delivering chatty, informal scripts. Ignoring this 
distinction (which is spelled out explicitly in the second 
paragraph), Professor Imai (2005, 298) cites this report 
as evidence that his “correction” squares with other re- 
sults showing that phone banks increase turnout signif- 
icantly. The thesis of Gerber and Green 2000 is not that 
door-to-door canvassing invariably works while phone 
calls invariably fail; it is that the effectiveness of alter- 
native forms of voter mobilization increases when mes- 
sages are delivered in an authentic, personal manner. 


Cost Accounting. Professor Imai’s central substan- 
tive claim is that we have overestimated the cost- 
effectiveness of personal visits vis-à-vis phone calls and 
direct mail. The experimental studies cited above show 
that calls from commercial phone banks have negligible 
effects on voter turnout. If four votes are produced for 
each 1000 people who are contacted by a commercial 
phone bank and if one pays 50 cents per contacted 
person, each additional vote costs $125. Commercial 
phone banks delivering brief nonpartisan get-out-the- 
vote calls are clearly not a cost-effective way to increase 
voter turnout (see Cardy 2005 and McNulty 2005 for 
similar findings regarding partisan calls from commer- 
cial phone banks). Let us now turn to Professor Imai’s 
further claims concerning direct mail. 

Professor Imai (2005, 290) contends that we have 
“underestimated the effectiveness of sending postcards 
by incorrectly comparing the estimated ITT [intent-to- 
treat] effects for postcards with the estimated treat- 
ment effects for visits.”8 Our mistaken conclusion, he 
asserts (289), is based on our alleged “assumption that 
all voters who were sent postcards actually received 
and read them.” Such an assumption is not warranted 
because many cards may not have reached a voter due 
to changes of address or may have been discarded as 
“junk mail.” We now review our calculations to demon- 
strate that they were correct and that Professor Imai’s 
claim is based on an accounting fallacy. 

To calculate the cost per vote of a mailing regimen, 
we reasoned as follows (Gerber and Green 2000, 661): 
“At fifty cents per mailer, sending three mailings to 
each household (containing an average of 1.5 voters) 
nets roughly one additional voter per $40 spent.” This 
figure was calculated based on our two-stage probit 


8 See below for a discussion of ITT effects 
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estimates using the following formula: 


$1.50 
1.5 -[(—0.138 + 3-0.0214) — 0(—0.138)] 
$1.50 


Using propensity score matching, Professor Imai re- 
ports in his Table 9 that three mailings increase 
turnout by 1.5 percentage points (SE=1.0). This 
statistically insignificant estimate implies a cost-per- 
vote estimate that is even higher than ours: $1.50/ 
(1.5 x 0.015) = $66.67. 

Next, we take up the charge that our comparison of 
the relative effectiveness of mail and visits is miscalcu- 
lated. Here is how we calculated the cost-effectiveness 
of visits (Gerber and Green 2000, 661): “Similar calcu- 
lations, using $1.50 as the cost per personal contact (10 
contacts per hour at $15 per hour), produce an estimate 
of approximately one more voter for each $8 spent.” 
Expressed algebraically, that calculation looks like this: 


$15/10 
1.5 -[b(—0.138 + 0.323) — b(—0.138)] 


$1.50 
aS 1.5-0.128 = $7.81. 


Professor Imai’s matching estimate of 9.2 percentage 
points (see his Table 9) generates a cost-per-vote esti- 
mate of $10.87 for personal visits. Either way, personal 
visits are more cost-effective than direct mail. 

These calculations produce the same answers if we 
use [TT estimates and rescale the calculation to elim- 
inate contact rates. Let C, be the cost per atternpted 
treatment, and let 7, be the number of votes produced 
per attempted treatment. The cost per vote of this in- 
tervention may be calculated as C,/T7,. If, instead, we 
define C, to be the cost of actually contacting someone, 
C, is equal to C, (the cost of attempting to treat) di- 
vided by «a (the proportion of people in the treatment 
group who are actually treated). The number of votes 
produced per actual treatment is 7,, which equals 7, /a, 
since the treatment-on-treated effect is the intent-to- 
treat effect divided by the contact rate. It follows, there- 
fore, that cost per vote equals 

cast per treatment CG “tale -G 


votes per treatment T. Taja T, 





_ cost per attempted treatment 
~ votes per attempted treatment’ 


This equation refutes Professor Imai’s (2005, 290) as- 
sertion that it is “inconsistent” to calculate cost per 
vote using treatment effects for visits and ITT effects 
for mail. Suppose we take Professor Imai’s advice to 
allow for the fact that some people do not read their 
mail. That means dividing both the numerator and the 
denominator by a, which leaves the estimated cost- 
efficiency unchanged.’ In sum, Professor Imai’s ar- 


> For example, rf one believes that only one-fourth of those who were 
sent mail read it, the treatment-on-treated effect for mail would 
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gument about cost-efficiency reflects confusion about 
how to manipulate the treatment and ITT effects. 
When this confusion is cleared up, the cost-efficiency 
of personal visits remains superior to direct mail, re- 
gardless of whether one uses his estimates or ours. 


WHY DOES PROPENSITY SCORE 
MATCHING PRODUCE INACCURATE 
ESTIMATES? 


The experimental evidence leaves little doubt about 
whether brief nonpartisan phone calls from telemar- 
keting firms are effective. The remaining question is 
why Professor Imai’s matching analysis produced inac- 
curate estimates. This section traces the bias in Profes- 
sor Imai’s estimator to two sources, the assumptions un- 
derlying the method and the problematic way in which 
it was implemented and reported. First, we model the 
selection problem that this matching procedure pur- 
ports to solve. Second, we note several instances in 
which Professor Imai exaggerates the inadequacies of 
instrumental variables estimation as a method for an- 
alyzing experimental data. Third, we show that the 
matching estimator that he offers as an alternative re- 
lies on much stronger substantive assumptions. Fourth, 
a review of the matching literature reveals that the gen- 
eral properties Professor Imai ascribes to this method 
are not supported by the empirical track record of this 
technique. Fifth, we show that the evidence that Pro- 
fessor Imai marshals to demonstrate the significance 
and robustness of his estimates is marred by compu- 
tational errors and discrepancies between the results 
of his computer programs and what he reports. When 
these defects are corrected, it turns out that his method 
fails both of the specification tests that he proposes as 
ways of detecting bias in propensity score matching, as 
well as two further specification tests that follow from 
the logic of his model. Finally, we demonstrate that 
the biases of Professor Imai’s estimator are not specific 
to the New Haven study. Applying matching to the 
replication experiments conducted in 2002 shows that 
this estimator again severely overestimates the effects 
of brief nonpartisan phone calls. 


Statistical Implications of Fallure to Treat 


When estimating experimental treatment effects, an- 
alysts must attend to complications that arise when 
some of the people who are assigned to the treatment 
are not actually treated. In the case of the New Haven 
experiment, for example, 32% of the 5,275 households 
assigned to receive get-out-the-vote phone calls in fact 
received them. The remaining 68% had unknown or 
nonworking telephone numbers, never answered the 
phone, or hung up on callers before they could read 
their scripts. Failure to treat a portion of the assigned 


be four times greater than the ITT effect. However, the cost per 
actual treatment would also be four trmes higher than the cost per 
attempted treatment. 
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treatment group creates a selection problem: if those 
reached by phone have a greater propensity to vote 
than those who were not, a comparison of the treated 
group (those actually reached by phone) and the con- 
trol group will overstate the effects of phone calls. Even 
if the phone calls truly have no effect on turnout, they 
may appear to have large effects. 

In Gerber and Green 2000, we presented a model of 
the selection process and used it to derive an estimator 
of the treatment effect that is consistent even in the 
presence of this selection problem. Suppose that for a 
given canvassing effort, the population can be divided 
into two groups, those who are reachable and those 
who are not. Let a be the proportion of the popula- 
tion that is reachable. Let p,, be the probability that a 
nonreachable person votes. Let p, be the probability 
that a reachable person votes in the absence of an ex- 
perimental treatment, and let p, + t be the probability 
that a reachable person votes after exposure to the 
experimental treatment. 

In the control group, we do not observe who is reach- 
able and who is not. What we observe is the voting rate 
for the group as a whole. The expected voting rate in 
the control group (Pc) is 


Po =ap,+(1—a@)ppr. (1) 


When treatment and control groups are formed ran- 
domly, both groups have the same expected propor- 
tions of reachable and nonreachable people. In this 
case, the expected voting rate in the treatment group is 


Pr=a(p,+t)+(—-a)pp. (2) 


Manipulating these equations and solving for t suggests 
the instrumental variables (IV) estimator for the treat- 


ment effect: 
2 (3) 


The numerator of this equation is the estimated ITT 
effect: it is simply the difference between the voting 
rate in the assigned treatment group (the subjects the 
experimenter intended to treat) and the voting rate 
in the control group. The denominator contains the 
“contact rate”: the fraction of people in the treatment 
group who were actually treated. Both quantities are 
easily calculated from experimental data. Because the 
contact rate is positive, ITT and ¢ always share the same 
sien. We show below that Professor Imai’s matching 
analysis does not satisfy this logical requirement. When 
analyzing the effects of phone calls, matching generates 
a negative estimate of the ITT but a positive estimate 
of the treatment effect. 

Professor Imai concedes that when experimental 
groups are assigned randomly, IV is the appropriate 
estimator of the treatment-on-treated effect (often re- 
ferred to as simply the “treatment effect”). He argues 


10 The parameter t can vary across those contacted, in which case t 
ts interpreted as the average treatment effect among those actually 
treated. 
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that when groups are not assigned randomly, the IV es- 
timator generates biased results. This argument is moot 
with respect to the experimental evidence presented 
above; the updated version of the New Haven dataset, 
as well as the Iowa and Michigan replication studies, 
shows no signs of randomization problems. ‘This is not 
the only concern that Professor Imai raises about IV, 
however. To justify matching, Professor Imai advances 
a number of additional criticisms of IV estimation. 
These criticisms rest on three flawed arguments. 


i. The Small Sample Bias of the IV Estimator Is 
Irrelevant in This Application. Professor Imai 
(2005) writes that it is “important to note” the fi- 
nite sample bias of IV estimation and claims that the 
small size of our treatment groups “suggests the impor- 
tance of finite sample considerations” (his footnote 18). 
However, the “importance” of small sample bias in our 
particular case is merely an assertion. It is easy to assess 
the small sample bias, under various contact rates, using 
simulation methods. Doing so reveals Professor Imai’s 
concerns to be groundless (for Monte Carlo results, see 
Gerber and Green 2004b). 


li. IV Regression Is Not “Inefficient.” In his foot- 
note 18, Professor Imai contends that IV is “inefficient” 
and cites Bound, Jaeger, and Baker (1995) for support. 
Bound et al. make no such claim. This appears to be 
one of several instances where the term of art “inef- 
ficient” is misused as a synonym for “large standard 
error.” Professor Imai attempts to bolster this theoret- 
ical claim with evidence, repeatedly assailing the IV 
estimator on the grounds that its estimates are some- 
times positive and sometimes negative when applied 
to the New Haven study. This line of argumentation 
conflates the properties of estimators (algorithms) with 
the behavior of estimates (statistics obtained from a 
particular sample). Professor Imai fails to explain that 
the apparent volatility of his IV estimates stems not 
from the properties of the IV estimator but rather from 
the simple fact that he restricts his attention to small 
subsamples of the New Haven data. To produce the 
astonishing —27 estimate that he twice mentions on the 
first page of his article, Professor Imai takes the subjects 
who were assigned to the phone treatment, discards 
those who were also assigned to receive mail or visits, 
and then discards those who reside in households with 
two registered voters. That leaves 7% of the original 
treatment group. It is hardly surprising that dramatic 
reductions in sample size increase the volatility of the 
IV estimates. 


ili. Randomization Failure Does Not Necessarily 
Lead to Bias in IV Estimation. In the model pre- 
sented above, randomization ensures that both the 
treatment and the control groups have the same ex- 
pected proportions (a) of reachable and nonreachable 
people. In any given dataset, these proportions may dif- 
fer due to sampling error. Professor Imai’s discussion 
of randomization frequently confuses the properties of 
randomization as a procedure with the outcome that re- 
sults from a given random assignment. Randomization 
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is an unbiased procedure even if in a particular exper- 
iment it produces a treatment group with too many 
young people or too many people in a certain sec- 
tion of town. To the extent that randomization gen- 
erates observable differences between treatment and 
control groups, an appropriate corrective is to con- 
trol for these differences. For this reason, Gerber 
and Green 2000 reports the experimental estimates 
both with and without controls for background char- 
acteristics. This point is glossed over in the “replica- 
tion” tables presented in Imai 2005, which exclude 
covariates. 

How is the IV estimator affected when randomiza- 
tion is compromised by clerical error? Suppose that cer- 
tain subjects had a greater probability of being assigned 
to the treatment group. This failing is not a sufficient 
cause of bias. Bias results only when the experimental 
assignment is correlated with omitted causes of the 
vote. Thus, even if Professor Imai had obtained sig- 
nificant results from a properly performed randomiza- 
tion check, he would not have established that the sys- 
tematic patterns in the experimental assignment pro- 
duced biased estimates. Professor Imai attempts to fill 
this logical gap by demonstrating that the treatment 
and control groups have different propensities to vote. 
He reports that in select subsamples, there are small 
and marginally significant correlations with past voting 
(Imai 2005, 293). Professor Imai neglects to mention 
that none of the treatment groups (phone, mail, or 
visit) bears a significant relationship to past voting in 
the sample as a whole. Nor are the treatment assign- 
ments correlated with age, number of voters in the 
household, or party registration. The largest absolute 
correlation between the phone treatment and any of 
these variables is .01 (nonsignificant), and jointly they 
do not significantly predict any treatment assignment 
in a logistic regression. It is not surprising, therefore, 
that correcting the clerical errors in the original New 
Haven data had little effect on the results. 

We now turn our attention to propensity score 
matching, which Professor Imai represents as a supe- 
rior statistical method. 


Matching Theory and Implementation 


When Is The Matching Estimator Unbiased? IV 
compares the turnout rates of those assigned to the 
treatment group with the turnout rates of those as- 
signed to the control group, because these groups have 
been formed in a way that ensures their comparability. 
The point of performing randomized experiments is 
to liberate analysts from making strong assumptions 
about the unknown values of p, and p wr. 

Professor Imai’s matching analysis, by contrast, com- 
pares those who actually received the treatment to the 
control group, after controlling for background char- 
acteristics. (This type of comparison has been shown to 
produce biased estimates in fields such as biostatistics; 
see Lee, Ellenberg, Hirtz, and Nelson 1991.) Suppose 
one were to restrict the sample to subjects who share 
exactly the same background characteristics, such as 
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age, party, and voting history. For this subgroup, Imai’s 


estimator may be expressed as 
C=p,+t—Pco=p,+t—ap,—(1-a)p, 
=t+(1—a)(p; — Dar). (4) 


The last term in this equation represents the bias 
in the matching estimator. When p, = pg, this last 
term cancels, and the bias is zero. In other words, the 
matching approach will work if, after controlling for 
background characteristics, reachable and nonreach- 
able people become interchangeable. This is a very 
strong (and demonstrably false) assumption. Bear in 
mind that in the version of the New Haven data that 
Professor Imai analyzed, the voting rate among those 
reached by phone was 64.8%, compared to 44.4% 
among those in the control group. By controlling for 
background characteristics, Professor Imai whittles this 
20.4-percentage point gap down to 6.5 percentage 
points. The question is whether the background infor- 
mation that he used was sufficient to control for all of 
the unobserved factors that could account for higher 
voting rates among those who answered the phone. 
The experimental results as well as the specification 
tests presented below demonstrate that the answer is 
no. 


Are Matching Estimators Accurate in General, 
and Should We Expect Matching to Work in This Ap- 
plication? Because Professor Imai’s estimator com- 
pares those actually treated to a comparison group 
of untreated individuals, it must overcome a selection 
problem. Although he declares that “when treatment 
assignment is not completely random and important 
covariates are observed, propensity score matching is 
the best available statistical method” (295), his en- 
thusiasm is not shared even by authors of the articles 
that he cites. Heckman et al. (1998, 1019), for example, 
caution: 


In general, matching is not guaranteed to reduce bias and 
may increase it.... Moreover, matching is open to many of 
the same criticisms that have been directed against tradı- 
tional econometric estimators because the method relies 
on arbitrary assumptions. Even with the rich data at our 
disposal, the method of matching is not, in general, an 
effective evaluation method. 


Among recent essays that describe the array of choices 
available to investigators trying to measure treatment 
effects, none give pride of place to cross-sectional 
propensity score matching, the estimator that Profes- 
sor Imai endorses (e.g., Angrist and Krueger 1999, 
Heckman, LaLonde, and Smith 1999, and Smith and 
Todd 2004). Also pertinent are meta-analyses by 
Bloom et al. (2002) and Glazerman, Levy, and Myers 
(2003), which evaluate various observational estima- 
tion techniques, including matching. They report that 
matching methods often fail to produce accurate treat- 
ment effect estimates. 

Propensity score matching’s mixed record is not re- 
flected in Professor Imai’s essay. He does not cite a sin- 
gle work on propensity scores written by others in the 
past six years, with the lone exception of Dehija and 


307 


Correction, Replication, and Reply to Imai 


Wahba (1999). Professor Imai (2005) cites this article to 
bolster the claim that “matching is known to effectively 
reduce bias caused by nonrandom treatment” (295) 
but neglects to report a well-known critique of their 
work, which claims that Dehija and Wahba’s results 
were an artifact of questionable sample restrictions and 
faulty analysis (Smith and Todd 2001). 

Are there any special reasons why propensity score 
matching is particularly well suited to gauging the 
effects of voter mobilization campaigns in the New 
Haven study? Professor Imai (2005, 295) offers three 
reasons to believe so: 


First, since the large control group roughly represents the 
population, we know that potential compliers exist in this 
group. Second, as shown later, I find many exact and close 
matches, indicating that the bias due to inexact matches 
is minimal. Third, when the covariates measuring mpor- 
tant characteristics of the subjects are available, propen- 
sity score matching 1s a powerful method for reducing 
bias. 


The first argument presupposes that the background 
information that Professor Imai has at his disposal 
(age, ward of residence, party registration, past vot- 
ing behavior, and number of voters in the household) 
can be used to identify people in the control group 
who have the same probability of voting as the peo- 
ple who received phone calls. What about people who 
have moved? What about people with unlisted phone 
numbers? What about people who are unwilling to 
talk with phone canvassers? It is doubtful that these 
background variables can be used to find appropriate 
matches for those who actually receive the treatment. 
Bear in mind that the pseudo-R of the propensity score 
model predicting who is treated is just .05. 

Professor Imai’s second argument avoids the cen- 
tral issue, which is the possibility that individuals who 
are successfully treated are different from individuals 
in the control group who share the same background 
characteristics. This argument also turns out to be mis- 
leading since the matching estimates reported in his 
Table 9 for the phone treatment are not based on exact 
matching. When estimating the effects of phone calls, 
Professor Imai matches each person who received a 
call to five people in the control group with the nearest 
propensity scores. Only a small proportion of these 
matches is exact. As we show below, the problem of 
inexact matches contributes to the bias in Professor 
Imai’s estimates, 

Professor Imai’s third argument repeats his unsup-~ 
ported assertion about the power of matching to reduce 
bias in general. The analytic and empirical literatures 
on the subject give no such assurances about the gen- 
eral properties of the method. 


Problematic Implementation of Propensity Score 
Matching. Table 2 shows that Professor Imai also ex- 
aggerates the magnitude, significance, and robustness 
of his matching estimates. We summarize four errors 
in Professor Imai’s text and tables; interested readers 
may replicate these results for themselves using the 
replication programs we posted. 
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Error I: Using Bootstrapping to Provide Point Es- 
timates, as Opposed to Sampling Variances. Boot- 
strapping is “a tool for developing inferential statis- 
tics (i.e., confidence intervals and bias estimators), not 
point estimators of parameters” (Mooney and Duval 
1993, 60). The point estimates that Professor Imai 
reports in his Table 9, however, were obtained by 
drawing N observations with replacement from the 
actual sample and then averaging over 500 replica- 
tions of this procedure. This bootstrapping procedure 
generates distorted results. Table 2 compares boot- 
strapped estimates with estimates from the actual sam- 
ple. The estimated effects of phone calls fall to insignif- 
icance when based on the actual sample. His match- 
ing program generates an estimate of 5.1 (SE=3.3) 
when performing one-to-five matching on the ac- 
tual sample, and this number drops to 3.6 (SE=4.5) 
when one-to-five matching is performed with replace- 
ment. 


Error 2: The Reported Bootstrapped Estimates in Pro- 
fessor Imai’s Table 9 Are Outliers. Table 9 of Imai 
2005 exaggerates the significance of the phone effect 
by reporting a coefficient of 6.5 with a standard error 
of 3.2. These numbers imply a f-ratio of 2.03, which 
tells the reader at a glance that the effect is statisti- 
cally significant using a two-sided test (p < .05). When 
these numbers are replicated using 500,000 bootstrap 
samples rather than 500, the correct estimate is 6.2, 
with a t-ratio of 1.87 (see Table 2). When we repeated 
Professor Imai’s procedure of 500 bootstrap samples 
1,000 times, only 2% of the time did we obtain a ¢-ratio 
as high as 2.03. 


Error 3: Imai Misreports the Results of His Two 
“Sensitivity” Tests. Professor Imai describes two sen- 
sitivity tests for his implementation of one-to-five 
matching. The first test indicates that the results do not 
depend on whether one performs one-to-five matching 
or one-to-one matching. Acknowledging that one-to- 
many matching causes the match quality to deterio- 
rate, Professor Imai (2005, 299) notes, “If the results 
based on one-to-many matching are significantly 
different from those of one-to-one matching... we may 
conclude that the former suffers from large bias.” His 
Table 9 indicates that one-to-five matching produces 
an estimate of 6.5; his text reports that one-to-one 
matching generates an estimate of “six” (299). This is 
incorrect. The one-to-one matching estimate is in fact 
4.3, with a standard error of 4.1. Moreover, as the first 
row in Table 2 indicates, there is a clear relationship 
between the number of matches and the size of the 
estimated treatment effect: the greater the number of 
matches, the larger the estimated effect. The second 
sensitivity test is found in Professor Imai’s footnote 24: 
“I report the results for matching without replace- 
ment, but the sensitivity analysis using matching with 
replacement produced similar results.” The exact num- 
bers behind these “similar results” are not reported. 
We report them in Table 2. Using Professor Imai’s 
program, we find that the bootstrap estimate for one- 
to-five matching with replacement is 4.0 (SE = 4.5). 
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TABLE 2. Replicating Professor Imal’s Matching Results Assessing the Effects of Phone Calls 


Number of Matches for Each Treated Observation 


Effect 
Treatment 


Bootstrap sample? 
Without replacement 


With replacement 


Actual sample’ 
Without replacement 


With replacement 


Intent to treat 


Bootstrap sample? 
Without replacement 


With replacement 


Actual sample’ 
Without replacement 


With replacement i 
2.6 j í 
Note. The propensity scores used for matching were generated based on the specification that Professor Imal (2005) reports In 
his footnote 21. Professor Imai verted via e-mall correepondence (September 19, 2003) that this was the spectfication used to 
generate his 1-1 and 1-5 matching estimates. 

2 imal 2002, (Table 7) reports 1—1 and 1-3 matching but not 1-5 matching Imal 2005, (Table 9) Introduces 1-5 matching and 
omits 1-1 and 1-3 matching. 

>Egtmates are averages of 25,000 bootstrapped samples. Note that using bootstrapping to obtan point estimates deviates from 
standard statistical practice but fs used here to replicate |mau's results. 

e imal (2005, 299) reports thls estimate as 6 (standard error not reported). 

dima (2005, Table 9) reports ths number as 6 5, with a standard error of 3.2. Our estimate ls based on 500,000 bootstrap 
sam 

e imal (2005, ftnt 24) describes this number as “similar” to 6 (standard error not reported). 

f This analysis does not draw bootstrap samples from the actual data. The only random element in this analysis concerns the 
breaking of tes between equally good matches. These results are based on the average estimates over 100 replicatons. The 
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standard errors are the bootstrapped standard errors reported in the rows above. 


For one-to-one matching, the figure is 3.7 (SE=4.8). 
These numbers convey a different impression of the 
treatment effect from the estimate of 6.5 (SE=3.2) 
reported in his Table 9. 


Error 4: A Computer Error Accounts for the Supposed 
Robustness of Matching Estimates. A recurrent theme 
of Professor Imai’s paper is the superior performance 
of matching when applied to subsamples of the New 
Haven data, such as those living in households with 
either one or two registered voters. His argument is 
that a good estimator produces similar results across 
subgroups.!! According to his Table 9, phone calls in- 
creased turnout by 6.9 percentage points in single-voter 
households and 6.1 percentage points in two-voter 
households. This pattern of similar estimates, however, 
occurs because Professor Imai’s computer program in- 


11 This hypothesis shifted over successive versions of Professor 
Imai’s APSR article. Professor Imai (2002, 25) earlier predicted a 
larger effect for one-voter households. The matching estimates re- 


ported in that paper supported his prediction. 





dexes household size incorrectly. The correct estimates 
are 2.1 and 10.0.4 

Two important errors of omission must also be men- 
tioned. The first concerns the negative ITT effects of 
phone calls. Logically, the effect of assigning people 
to be called must be the same sign as the effect of 
actually calling them. Although Professor Imai reports 
the ITT estimates using regression and claims that ITT 
estimates are needed for cost-effectiveness compar- 
isons, he reports the matching estimates of the ITT 
effect only for mail, not for phone calls. Instead, the 
reader is shown only the positive matching estimates 
for the phone treatment effects and is informed that the 


12 When reporting the corrected results, we have followed Profes- 
sor Imai’s practice of dividing the sample according to whether the 
treated subjects live in one- or two-person households. Professor 
Imai’s program matches these subjects to the nearest propensity 
scores in the control group, regardless of whether these control sub- 
jects reside in one- or two-person households. He does not alert 
the reader to this fact, which invalidates the companson between 
matching and IV for one- and two-voter subgroups. The computer 
error does not affect his estimates for mail. Without explanation, he 
estimated these matching models separately for one- and two-person 
households. 


Correction, Replication, and Reply to Imai 


negative estimates generated by OLS and 2SLS some- 
how reflect the deficiencies of these estimators when 
applied to flawed experimental data. ‘Table 2 shows that 
the negative ITT estimates have nothing to do with the 
use of linear, parametric estimators. Using Professor 
Imai’s matching program, we estimated the ITT effect 
of phone calls. Regardless of the number of matches 
and regardless of whether one uses the actual sample or 
bootstrap samples, propensity score matching always 
produces insignificant negative ITT estimates, ranging 
from —2.8 to —2.1, with standard errors ranging from 
1.9 to 2.6. (Bowers and Hansen 2005, analyzing the 
same data, also obtain weakly negative ITT estimates 
using matching.) The implication is that the effects 
of phone calls were statistically indistinguishable from 
Zero. 

The second omission is the failure to check whether 
the nontreated vote at the same rate as the control 
group. The key identifying assumption in Professor 
Imai’s estimator is that the people who are reached 
by phone (the “treated” group) are identical to the 
people in the control group who share background 
characteristics, This reasoning leads him to infer the 
causal effect of phone calls from the fact that these 
two groups vote at different rates. Looking only at 
Professor Imai’s results, one cannot know whether the 
observed differences between the treated group and 
the matched control group reflect a true positive effect 
of the treatment or the upward bias of the matching 
procedure (see Eq. [4| above). But the same logic also 
implies that people who were assigned to the treatment 
group but not contacted will vote at the same rate as 
those in the control group with similar background 
characteristics (see Imai 2005, Figure 1). Since there 
can be no treatment effect without a treatment, the 
difference between the untreated and their matched 
counterparts can only be attributed to the bias of the 
matching estimator. 

Using Professor Imai’s matching program to com- 
pare the noncontacted members of the treatment group 
to the control group reveals that those who could not 
be reached by phone vote at substantially lower rates, 
The estimate produced by one-to-five propensity score 
matching is —5.6, with a standard error of 2.3.% The 
hypothesis that matching produces unbiased estimates 
is therefore easily rejected at the 5% level. Note that 
the negative matching estimate implies that placing 
phone numbers on a list and not calling them depresses 
turnout. The absurdity of this proposition makes it 
a powerful diagnostic tool. It demonstrates that un- 
observed causes of voter turnout are correlated with 
whether a subject is reachable by phone. 

The remaining question is whether the pattern of 
biases detected in the New Haven data can be repro- 
duced in other studies. The answer is yes. When we 
apply matching to the enormous Iowa and Michigan 
samples, using covariates similar to those available in 


3 The ty score model used to generate this estimate uses 
all of the main effects and first-order mteractions among the back- 
ground variables. This model passes the balancing tests described by 
Professor Ima 
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the New Haven study, we obtain upwardly biased esti- 
mates of 4.2 (SE =0.4) and 3.2 (SE=0.5), respectively, 
for the effects of phone calls.\4 We also find strong 
evidence that those who were not reachable by phone 
were less likely to vote than the control group, a clear 
sien of bias. In sum, matching produces misleading es- 
timates in this application. 


COMMENTS ON INTERPRETATION 
AND RESEARCH DESIGN 


Professor Imai contends that, when properly analyzed, 
the New Haven experiment shows that phone calls 
dramatically increase voter turnout. Due to the ready 
availability of experimental data, Professor Imai’s con- 
clusion can be evaluated with rare precision. The New 
Haven experiment and other large-scale field exper- 
iments demonstrate the ineffectiveness of brief calls 
from commercial phone banks The probability that 
the treatment effect is as large as the 6.5 that Professor 
Imai presents is effectively zero. 

Having addressed Professor Imai’s main statistical 
claim, we conclude by briefly responding to three fur- 
ther criticisms that he levies against the analysis pre- 
sented in Gerber and Green 2000. These criticisms 
allege that (1) the use of factorial design in the New 
Haven experiment was “incorrect” and “inefficient,” 
(2) we failed to take notice of many important nonlin- 
earities and interactions, and (3) we resist the use of 
advanced statistical methods. 


Factorial Design 


Professor Imai (287-288) sharply criticizes the use of 
factorial design and declares the use of multiple treat- 
ments “incorrect” and “inefficient.” Asserting that “in 
principle, it is advisable to minimize the number of 
treatments in field experiments” (288), his correction to 
the New Haven study is to discard all observations as- 
signed to multiple treatments (289). We would caution 
readers that Professor Imai’s practice of discarding all 
observations assigned to multiple treatments is highly 
idiosyncratic. He cites no authorities, provides no sta- 
tistical analysis, and ignores the large and trenchant 
literature that endorses factorial design, particularly in 
the early phases of a research program (see Mead 1988, 
584-85). 

Professor Imai’s discussion of factorial design is 
predicated on a mistaken understanding of what it 
means to estimate the “marginal effect” of a treatment. 
This endeavor does not require one to compare a single 


14 The large sample sizes of these studies enable us to use exact 
matching, rather than propensity score matching (see Rosenbaum 
and Rubin 1985 on the supenority of exact matching) Because the 
treated and control observations share exactly the same values of the 
covariates, the two groups are automatically “balanced.” In Iowa, 
10,299 of the treated observations (75%) were matched to 132,577 
observations in the control group. In Michigan, 9,565 of the treated 
(85%) were matched to 167,599 observations in the control group To 
mimic the New Haven study, the control groups in Iowa and Michigan 
included randomly selected individuals without known telephone 
numbers. 
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{:. a are freer to ax preferences and that ne are mo 

likely to rsapond to these preferences; that in democracy there is more room for 

7 Sae meaningful debate and deliberation; and that under the proper conditions democratic 
decision-making will produce falr and Just social outcomes. Researchers have 


| A _ demonstrated that people who live in democrecies, on averagd, earn higher wages, are 
Cee freer to form organizations, enjoy a broader range of public services, are less likely to 
ee eae ot a go to war and to suffer from famines, and enjoy more responsive governments than do 


a j ina: Ay SS ` People who live under non-democratic regimes. Yet it is clearer today than It has been 
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